OBG3 globular head and uses thereof for decreasing body mass

ABSTRACT

The present invention relates to the field of obesity research. Obesity is a public health problem that is serious and widespread. A compound, globular OBG3, has been identified that reduces weight gain in animals. This compound should be effective for reducing body mass and for treating obesity-related diseases and disorders. These obesity-related diseases and disorders include hyperlipidemias, atherosclerosis, diabetes, and hypertension.

CROSS-REFERENCE TO RELATED APPLICATIONS

The present application is a continuation-in-part of U.S. Ser. No. 09/776,976, filed Feb. 5, 2001 which is a continuation-in-part of U.S. Ser. No. 09/758,055, filed Jan. 10, 2001 which claims priority under 35 U.S.C. §119(e) to U.S. Provisional Patent Application Serial No. 60/176,228 filed Jan. 14, 2000; U.S. Provisional Patent Application Serial No 60/198,087, filed Apr. 13, 2000; and U.S. Provisional Patent Application Serial No. 60/299,881 filed Sep. 1, 2000, all of which are hereby incorporated by reference herein in their entireties, including any figures, tables, nucleic acid sequences, amino acid sequences, or drawings.

FIELD OF THE INVENTION

The present invention relates to the field of metabolic research, in particular the discovery of compounds effective for reducing body mass and useful for treating obesity-related diseases and disorders. The obesity-related diseases or disorders envisioned to be treated by the methods of the invention include, but are not limited to, hyperlipidemia, atherosclerosis, diabetes, and hypertension.

BACKGROUND OF THE INVENTION

The following discussion is intended to facilitate the understanding of the invention, but is not intended nor admitted to be prior art to the invention.

Obesity is a public health problem that is serious, widespread, and increasing. In the United States, 20 percent of the population is obese; in Europe, a slightly lower percentage is obese [Friedman (2000) Nature 404:632-634]. Obesity is associated with increased risk of hypertension, cardiovascular disease, diabetes, and cancer as well as respiratory complications and osteoarthritis [Kopelman (2000) Nature 404:635-643]. Even modest weight loss ameliorates these associated conditions.

While still acknowledging that lifestyle factors including environment, diet, age and exercise play a role in obesity, twin studies, analyses of familial aggregation, and adoption studies all indicate that obesity is largely the result of genetic factors (Barsh et al. (2000) Nature 404:644-651). In agreement with these studies, is the fact that an increasing number of obesity-related genes are being identified. Some of the more extensively studied genes include those encoding leptin (ob) and its receptor (db), pro-opiomelanocortin (Pomc), melanocortin-4-receptor (Mc4r), agouti protein (A^(y)), carboxypeptidase E (fat), 5-hydroxytryptamine receptor 2C (Htr2c), nescient basic helix-loop-helix 2 (Nhlh2), prohormone convertase 1 IPCSK1), and tubby protein (tubby) (reveiwed in Barsh et al. (2000) Nature 404:644-651).

SUMMARY OF THE INVENTION

The instant invention is based on the discovery that portions of the full length OBG3 polypeptide, termed OBG3 polypeptide fragments or gOBG3 polypeptide fragments, have unexpected effects in vitro and in vivo, including utility for reduction of body weight, prevention of weight gain, reduction blood triglyceride levels, control of blood lipid levels and control of blood glucose levels in humans and other mammals. These unexpected effects of OBG3 or gOBG3 polypeptide fragment administration in mammals also include a reduction of an elevated level of blood free fatty acids, particularly when the elevation is caused an increase in epinephrine levels, i.v. injection of “intralipid”, or intake of a high fat test meal, as well as an increased fatty acid oxidation, particularly in muscle cells, and weight reduction in mammals, particularly in mammals consuming a high fat/high sucrose diet. These effects are unexpected and surprising given that administration of full-length OBG3 polypeptide typically has no effect or a significantly reduced effect in vivo or in vitro depending upon the specific biological activity. To the extent that any effect is observed following administration of full-length OBG3 polypeptide, the levels of full-length OBG3 polypeptide required for an effect render it unfeasible in most instances as a potential treatment for humans at this time. In contrast, the OBG3 and gOBG3 polypeptide fragments of the invention are radically more effective and thus can be provided at levels that are feasible for treatments in humans.

Thus, the invention is drawn to OBG3 and gOBG3 polypeptide fragments, polynucleotides encoding said OBG3 and gOBG3 polypeptide fragments, vectors comprising said OBG3 and gOBG3 polynucleotides, and cells recombinant for said OBG3 and gOBG3 polynucleotides, as well as to pharmaceutical and physiologically acceptable compositions comprising said OBG3 and gOBG3 polypeptide fragments and methods of administering said OBG3 and gOBG3 pharmaceutical and physiologically acceptable compositions in order to reduce body weight or to treat obesity-related diseases and disorders. Assays for identifying agonists and antagonists of obesity-related activity are also part of the invention.

In a first aspect, the invention features a purified, isolated, or recombinant OBG3 or gOBG3 polypeptide fragment that has significantly greater activity than a full-length OBG3 polypeptide, wherein said activity is selected from the group consisting of lipid partitioning, lipid metabolism, and insulin-like activity. In preferred embodiments, said polypeptide fragment comprises, consists essentially of, or consists of, at least 6 and not more than 238 consecutive amino acids of SEQ ID NO:6 or at least 6 and not more than 241 consecutive amino acids of SEQ ID NO:2 or SEQ ID NO:4. In other preferred embodiments, said polypeptide fragment comprises an amino acid sequence at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to the corresponding consecutive amino acids of SEQ ID NO:6, SEQ ID NO:2 or SEQ ID NO:4.

In other highly preferred embodiments, said polypeptide fragment comprises, consists essentially of, or consists of, a purified, isolated, or recombinant gOBG3 fragment. Preferably, said gOBG3 polypeptide fragment comprises, consists essentially of, or consists of, at least 6 consecutive amino acids of amino acids 88 to 244 of SEQ ID NO:6 or at least 6 consecutive amino acids of amino acids 91 to 247 of SEQ ID NO:2 or SEQ ID NO:4. Alternatively, said gOBG3 fragment comprises, consists essentially of, or consists of, an amino acid sequence at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to the corresponding amino acids 88 to 244 of SEQ ID NO:6 or at least 75% identical to amino acids 91 to 247 of SEQ ID NO:2 or SEQ ID NO:4. In a further preferred embodiment, the OBG3 polypeptide fragment is identical to an apM1 proteolytic cleavage product from human plasma. Preferably, the proteolytic cleavage product comprises the C1q globular head or a portion thereof. More preferably, the proteolytic cleavage product is identical to a proteolytic cleavage product isolated from human plasma by immunoprecipitation using antibodies specific for the C1q globular head. More preferably the proteolytic cleavage product cannot be immunoprecipitated from human plasma using an antibody directed against the human non-homologous region (HDQETTTQGPGVLLPLPKGA) of apM1. Still more preferably, the apM1. proteolytic cleavage product has an apparent molecular weight of 27 kDa using SDS-PAGE.

In a further preferred embodiment, the OBG3 or gOBG3 polypeptide fragment is able to lower circulating (either blood, serum or plasma) levels (concentration) of: (i) free fatty acids, (ii) glucose, (iii) triglycerides, and/or cholesterol. Further preferred polypeptide fragments demonstrating free fatty acid level lowering activity, glucose level lowering activity, and/or triglyceride level lowering activity, have an activity that is significantly greater than full length OBG3 at the same molar concentration, have a greater than transient activity and/or have a sustained activity.

Further preferred OBG3 or gOBG3 polypeptide fragments are those that significantly stimulate muscle lipid or free fatty acid oxidation as compared to full length OBG3 polypeptides at the same molar concentration. Further preferred OBG3 or gOBG3 polypeptide fragments are those that cause C2C12 cells differentiated in the presence of said fragments to undergo at least 10%, 20%, 30%, 35%, or 40% more oleate oxidation as compared to untreated cells or cells treated with full length OBG3.

Further preferred OBG3 or gOBG3 polypeptide fragments are those that are at least 30% more efficient than full length OBG3 at increasing leptin uptake in a liver cell line preferably BPRCL mouse liver cells (ATCC CRL-2217)).

Further preferred OBG3 or gOBG3 polypeptide fragments are those that significantly reduce the postprandial increase in plasma free fatty acids, particularly following a high fat meal.

Further preferred OBG3 or gOBG3 polypeptide fragments are those that significantly reduce or eliminate ketone body production, particularly following a high fat meal.

Further preferred OBG3 or gOBG3 polypeptide fragments are those that increase glucose uptake in skeletal muscle cells.

Further preferred OBG3 or gOBG3 polypeptide fragments are those that increase fatty acid uptake in skeletal muscle cells.

Further preferred OBG3 or gOBG3 polypeptide fragments are those that increase glucose uptake in adipose cells.

Further preferred OBG3 or gOBG3 polypeptide fragments are those that increase glucose uptake in neuronal cells.

Further preferred OBG3 or gOBG3 polypeptide fragments are those that increase glucose uptake in red blood cells.

Further preferred OBG3 or gOBG3 polypeptide fragments are those that increase glucose uptake in the brain.

Further preferred OBG3 or gOBG3 polypeptide fragments are those that significantly reduce the postprandial increase in plasma glucose following a meal, particularly a high carbohydrate meal.

Further preferred OBG3 or gOBG3 polypeptide fragments are those that significantly prevent the postprandial increase in plasma glucose following a meal, particularly a high fat or a high carbohydrate meal.

Further preferred OBG3 or gOBG3 polypeptide fragments are those that improve insulin sensitivity.

Further preferred OBG3 or gOBG3 polypeptide fragments are those that form multimers (e.g., heteromultimers or homomultimers) in vitro and/or in vivo. Preferred multimers are homodimers or homotrimers. Other preferred multimers are homomultimers comprising at least 4, 6, 8, 9, 10 or 12 OBG3 or gOBG3 polypeptide fragment subunits. Other preferred mulimers are hetero multimers comprising a OBG3 or gOBG3 polypeptide fragment of the invention.

Further preferred embodiments include heterologous polypeptides comprising an OBG3 or gOBG3 polypeptide fragment of the invention.

In a second aspect, the invention features a purified, isolated, or recombinant polynucleotide encoding said OBG3 polypeptide fragment described in the first aspect, or the complement thereof. In further embodiments the polynucleotides are DNA, RNA, DNA/RNA hybrids, single-stranded, and double-stranded.

In a third aspect, the invention features a recombinant vector comprising, consisting essentially of, or consisting of, said polynucleotide described in the second aspect.

In a fourth aspect, the invention features a recombinant cell comprising, consisting essentially of, or consisting of, said recombinant vector described in the third aspect. A further embodiment includes a host cell recombinant for a polynucleotide of the invention.

In a fifth aspect, the invention features a pharmaceutical or physiologically acceptable composition comprising, consisting essentially of, or consisting of, said OBG3 or gOBG3 polypeptide fragment described in the first aspect and, alternatively, a pharmaceutical or physiologically acceptable diluent.

In a sixth aspect, the invention features a method of reducing body mass comprising providing or administering to individuals in need of reducing body mass said pharmaceutical or physiologically acceptable composition described in the fifth aspect.

In preferred embodiments, the identification of said individuals in need of reducing body mass to be treated with said pharmaceutical or physiologically acceptable composition comprises genotyping OBG3 single nucleotide polymorphisms (SNPs) or measuring OBG3 or gOBG3 polypeptide or mRNA levels in clinical samples from said individuals. Preferably, said clinical samples are selected from the group consisting of plasma, urine, and saliva. Preferably, an OBG3 or gOBG3 polypeptide fragment of the present invention is administered to an individual with at least a 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, or 100% reduction in blood, serum or plasma levels of full length OBG3 or the naturally proteolytically cleaved OBG3 fragment as compared to healthy, non-obese patients.

In a seventh aspect, the invention features a method of preventing or treating an obesity-related disease or disorder comprising providing or administering to an individual in need of such treatment said pharmaceutical or physiologically acceptable composition described in the fifth aspect. In preferred embodiments, the identification of said individuals in need of such treatment to be treated with said pharmaceutical or physiologically acceptable composition comprises genotyping OBG3 single nucleotide polymorphisms (SNPs) or measuring OBG3 or gOBG3 polypeptide or mRNA levels in clinical samples from said individuals. Preferably, said clinical samples are selected from the group consisting of blood, serum, plasma, urine, and saliva. Preferably, said obesity-related disease or disorder is selected from the group consisting of obesity, insulin resistance, atherosclerosis, atheromatous disease, heart disease, hypertension, stroke, Syndrome X, Noninsulin Dependent Diabetes Mellitus (NIDDM, or Type II diabetes) and Insulin Dependent Diabetes Mellitus (IDDM or Type I diabetes). Diabetes-related complications to be treated by the methods of the invention include microangiopathic lesions, ocular lesions, retinopathy, neuropathy, and renal lesions. Heart disease includes, but is not limited to, cardiac insufficiency, coronary insufficiency, and high blood pressure. Other obesity-related disorders to be treated by compounds of the invention include hyperlipidemia and hyperuricemia. Yet other obesity-related diseases or disorders of the invention include cachexia, wasting, AIDS-related weight loss, anorexia, and bulimia. In preferred embodiments, said individual is a mammal, preferably a human.

In related aspects, embodiments of the present invention includes methods of causing or inducing a desired biological response in an individual comprising the steps of: providing or administering to an individual a composition comprising an OBG3 or gOBG3 polypeptide fragment, wherein said biological response is selected from the group consisting of:

a) lowering circulating (either blood, serum, or plasma) levels (concentration) of free fatty acids;

b) lowering circulating (either blood, serum or plasma) levels (concentration) of glucose;

c) lowering circulating (either blood, serum or plasma) levels (concentration) of triglycerides;

d) lowering circulating (either blood, serum or plasma) levels (concentration) of cholesterol;

e) stimulating muscle lipid or free fatty acid oxidation;

f) increasing leptin uptake in the liver or liver cells;

g) reducing the postprandial increase in plasma free fatty acids, particularly following a high fat meal; and,

h) reducing or eliminating ketone body production, particularly following a high fat meal;

i) increasing tissue sensitivity to insulin, particularly muscle, adipose, liver or brain,

j) and further wherein said biological response is significantly greater than, or at least 10%, 20%, 30%, 35%, or 40% greater than, the biological response caused or induced by a full length OBG3 polypeptide at the same molar concentration; or alternatively wherein said biological response is greater than a transient response; or alternativley wherein said biological response is sustained. In further preferred embodiments, the present invention of said pharmaceutical or physiologically acceptable composition can be used as a method to control blood glucose in some persons with Noninsulin Dependent Diabetes Mellitus (NIDDM, Type II diabetes) in combination with insulin therapy.

In further preferred embodiments, the present invention of said pharmaceutical or physiologically acceptable composition can be used as a method to control blood glucose in some persons with Insulin Dependent Diabetes Mellitus (IDDM, Type I diabetes) in combination with insulin therapy.

In further preferred embodiments, the present invention of said pharmaceutical or physiologically acceptable composition can be used as a method to control body weight in some persons with Noninsulin Dependent Diabetes Mellitus (NIDDM, Type II diabetes) in combination with insulin therapy.

In further preferred embodiments, the present invention of said pharmaceutical or physiologically acceptable composition can be used as a method to control body weight in some persons with Insulin Dependent Diabetes Mellitus (IDDM, Type I diabetes) in combination with insulin therapy.

In further preferred embodiments, the present invention of said pharmaceutical or physiologically acceptable composition can be used as a method to control blood glucose in some persons with Noninsulin Dependent Diabetes Mellitus (NIDDM, Type II diabetes) alone, without combination of insulin therapy.

In further preferred embodiments, the present invention of said pharmaceutical or physiologically acceptable composition can be used as a method to control blood glucose in some persons with Insulin Dependent Diabetes Mellitus (IDDM, Type I diabetes) alone, without combination of insulin therapy.

In further preferred embodiments, the present invention of said pharmaceutical or physiologically acceptable composition can be used as a method to control body weight in some persons with Noninsulin Dependent Diabetes Mellitus (NIDDM, Type II diabetes) alone, without combination of insulin therapy.

In further preferred embodiments, the present invention of said pharmaceutical or physiologically acceptable composition can be used as a method to control body weight in some persons with Insulin Dependent Diabetes Mellitus (IDDM, Type I diabetes) alone, without combination of insulin therapy.

In a further preferred embodiment, the present invention may be used in complementary therapy of NIDDM patients to improve their weight or glucose control in combination with an insulin secretagogue or an insulin sensitising agent. Preferably, the insulin secretagogue is 1,1-dimethyl-2-(2-morpholino phenyl)guanidine fumarate (BTS67582) or a sulphonylurea selected from tolbutamide, tolazamide, chlorpropamide, glibenclamide, glimepiride, glipizide and glidazide. Preferably, the insulin sensitising agent is selected from metformin, ciglitazone, troglitazone and pioglitazone.

The present invention further provides a method of improving the body weight or glucose control of NIDDM patients alone, without an insulin secretagogue or an insulin sensitising agent.

In a further preferred embodiment, the present invention may be used in complementary therapy of IDDM patients to improve their weight or glucose control in combination with an insulin secretagogue or an insulin sensitising agent. Preferably, the insulin secretagogue is 1,1-dimethyl-2-(2-morpholino phenyl)guanidine fumarate (BTS67582) or a sulphonylurea selected from tolbutamide, tolazamide, chlorpropamide, glibenclamide, glimepiride, glipizide and glidazide. Preferably, the insulin sensitising agent is selected from metformin, ciglitazone, troglitazone and pioglitazone.

The present invention further provides a method of improving the body weight or glucose control of IDDM patients alone, without an insulin secretagogue or an insulin sensitising agent.

In a further preferred embodiment, the present invention may be administered either concomitantly or concurrently, with an insulin secretagogue or an insulin sensitising agent for example in the form of separate dosage units to be used simultaneously, separately or sequentially (either before or after the secretagogue or either before or after the sensitising agent). Accordingly, the present invention further provides for a composition of pharmaceutical or physiologically acceptable composition and an insulin secretagogue or an insulin sensitising agent as a combined preparation for simultaneous, separate or sequential use for the improvement of body weight or glucose control in NIDDM or IDDM patients.

In further preferred embodiments, the present invention of said pharmaceutical or physiologically acceptable composition further provides a method for the use as an insulin sensitiser.

In further preferred embodiments, the present invention of said pharmaceutical or physiologically acceptable composition can be used as a method to improve insulin sensitivity in some persons with Noninsulin Dependent Diabetes Mellitus (NIDDM, Type II diabetes) in combination with insulin therapy.

In further preferred embodiments, the present invention of said pharmaceutical or physiologically acceptable composition can be used as a method to improve insulin sensitivity in some persons with Insulin Dependent Diabetes Mellitus (IDDM, Type I diabetes) in combination with insulin therapy.

In further preferred embodiments, the present invention of said pharmaceutical or physiologically acceptable composition can be used as a method to improve insulin sensitivity in some persons with Noninsulin Dependent Diabetes Mellitus (NIDDM, Type II diabetes) without insulin therapy.

In an eighth aspect, the invention features a method of making the OBG3 polypeptide fragment described in the first aspect, wherein said method is selected from the group consisting of: proteolytic cleavage, recombinant methodology and artificial synthesis.

In a ninth aspect, the present invention provides a method of making a recombinant OBG3 or gOBG3 polypeptide fragment or a full-length OBG3 polypeptide, the method comprising providing a transgenic, non-human mammal whose milk contains said recombinant OBG3 or gOBG3 polypeptide fragment or full-length protein, and purifying said recombinant OBG3 or gOBG3 polypeptide fragment or said full-length OBG3 polypeptide from the milk of said non-human mammal. In one embodiment, said non-human mammal is a cow, goat, sheep, rabbit, or mouse. In another embodiment, the method comprises purifying a recombinant full-length OBG3 polypeptide from said milk, and further comprises cleaving said protein in vitro to obtain a desired OBG3 or gOBG3 polypeptide fragment.

In a tenth aspect, the invention features a use of the polypeptide described in the first aspect for treatment of obesity-related diseases and disorders and/or reducing or increasing body mass. Preferably, said obesity-related diseases and disorders are selected from the group consisting of obesity, insulin resistance, atherosclerosis, atheromatous disease, heart disease, hypertension, stroke, Syndrome X, Noninsulin Dependent Diabetes Mellitus (NIDDM, or Type II diabetes) and Insulin Dependent Diabetes Mellitus (IDDM or Type I diabetes). Diabetes-related complications to be treated by the methods of the invention include microangiopathic lesions, ocular lesions, retinopathy, neuropathy, and renal lesions. Heart disease includes, but is not limited to, cardiac insufficiency, coronary insufficiency, and high blood pressure. Other obesity-related disorders to be treated by compounds of the invention include hyperlipidemia and hyperuricemia. Yet other obesity-related related diseases or disorders of the invention include cachexia, wasting, AIDS-related weight loss, anorexia, and bulimia.

In an eleventh aspect, the invention features a use of the polypeptide described in the first aspect for the preparation of a medicament for the treatment of obesity-related diseases and disorders and/or for reducing body mass. Preferably, said obesity-related disease or disorder is selected from the group consisting of obesity, insulin resistance, atherosclerosis, atheromatous disease, heart disease, hypertension, stroke, Syndrome X, Noninsulin Dependent Diabetes Mellitus (NIDDM, or Type II diabetes) and Insulin Dependent Diabetes Mellitus (IDDM or Type I diabetes). Diabetes-related complications to be treated by the methods of the invention include microangiopathic lesions, ocular lesions, retinopathy, neuropathy, and renal lesions. Heart disease includes, but is not limited to, cardiac insufficiency, coronary insufficiency, and high blood pressure. Other obesity-related disorders to be treated by compounds of the invention include hyperlipidemia and hyperuricemia. Yet other obesity-related diseases or disorders of the invention include cachexia, wasting, AIDS-related weight loss, anorexia, and bulimia. In preferred embodiments, said individual is a mammal, preferably a human.

In a twelfth aspect, the invention provides a polypeptide of the first aspect of the invention, or a composition of the fifth aspect of the invention, for use in a method of treatment of the human or animal body.

In a thirteenth aspect, the invention features methods of reducing body weight comprising providing to an individual said pharmaceutical or physiologically acceptable composition described in the fifth aspect, or the polypeptide described in the first aspect. Where the reduction of body weight is practiced for cosmetic purposes, the individual has a BMI of at least 20 and no more than 25. In embodiments for the treatment of obesity, the individual may have a BMI of at least 20. One embodiment for the treatment of obesity provides for the treatment of individuals with BMI values of at least 25. Another embodiment for the treatment of obesity provides for the treatment of individuals with BMI values of at least 30. Yet another embodiment provides for the treatment of individuals with BMI values of at least 40. Alternatively, for increasing the body weight of an individual, the BMI value should be at least 15 and no more than 20.

In a fourteenth aspect, the invention features the pharmaceutical or physiologically acceptable composition described in the fifth aspect for reducing body mass and/or for treatment or prevention of obesity-related diseases or disorders. Preferably, said obesity-related disease or disorder is selected from the group consisting of obesity, insulin resistance, atherosclerosis, atheromatous disease, heart disease, hypertension, stroke, Syndrome X, Noninsulin Dependent Diabetes Mellitus (NIDDM, or Type II diabetes) and Insulin Dependent Diabetes Mellitus (IDDM or Type I diabetes). Diabetes-related complications to be treated by the methods of the invention include microangiopathic lesions, ocular lesions, retinopathy, neuropathy, and renal lesions. Heart disease includes, but is not limited to, cardiac insufficiency, coronary insufficiency, and high blood pressure. Other obesity-related disorders to be treated by compounds of the invention include hyperlipidemia and hyperuricemia. Yet other obesity-related diseases or disorders of the invention include cachexia, wasting, AIDS-related weight loss, anorexia, and bulimia. In preferred embodiments, said individual is a mammal, preferably a human. In preferred embodiments, the identification of said individuals to be treated with said pharmaceutical or physiologically acceptable composition comprises genotyping OBG3 single nucleotide polymorphisms (SNPs) or measuring OBG3 or gOBG3 polypeptide or mRNA levels in clinical samples from said individuals. Preferably, said clinical samples are selected from the group consisting of blood, serum, plasma, urine, and saliva.

In a fifteenth aspect, the invention features the pharmaceutical or physiologically acceptable composition described in the fifth aspect for reducing body weight for cosmetic reasons.

In a sixteenth aspect, the OBG3 or OBG3 polypeptide fragments are the invention features methods treating insulin resistance comprising providing to an individual said pharmaceutical or physiologically acceptable composition described in the fifth aspect, or the polypeptide described in the first aspect.

In a preferred aspect of the methods above and disclosed herein, the amount of OBG3 or gOBG3 polypeptide fragment or polynucleotide administered to an individual is sufficient to bring circulating (blood, serum, or plasma) levels (concentration) of OBG3 polypeptides to their normal levels (levels in non-obese individuals). “Normal levels” may be specified as the total concentration of all circulating OBG3 polypeptides (full length OBG3 and fragments thereof) or the concentration of all circulating proteolytically cleaved OBG3 polypeptides only.

Full length OBG3 (Acrp30, AdipoQ, apM1) polypeptides and polynucleotides encoding the same may be specifically substituted for an OBG3 or gOBG3 polypeptide fragment or polynucleotide encoding the same in any embodiment of the present invention.

BRIEF DESCRIPTION OF THE FIGURES

FIG. 1 shows an alignment of the sequences of the human (apM1), and mouse (AdipoQ and Acrp30) OBG3 polypeptides.

FIG. 2 shows the nucleic acid sequence of AdipoQ cloned into the BamHI and XhoI sites of pTrcHisB. AdipoQ begins at 510 and ends at 1214 (insert in bold). This construct does not contain the N-term signal sequence (MLLLQALLFLLILP).

FIG. 3 shows a schematic drawing of the protein structure of apM1. The putative signal sequence at the N-terminus (AA 1-17), the unique region (AA 18-41), the collagen region (AA 42-107), and the globular region (AA 108-244) at the carboxy terminus are shown. Two protease cleavage sites after AA 100 and AA 131 are also shown.

FIG. 4 shows the nucleic acid sequence of the globular region of AdipoQ cloned into pTrcHisB. AdipoQ globular region begins at 510 and ends at 927 bp. The insert is in bold.

FIG. 5 is a graph showing a comparison of the effect of AdipoQ (AQ) and AdipoQ globular head (AQ-GH) on cell-associated ¹²⁵I-leptin in the mouse liver cell line BPRCL. Results are shown as percent of control values in the presence of increasing amounts of compound (AQ or AQ-GH), and are the mean of triplicate determinations.

FIGS. 6A, 6B, and 6C show graphs of ¹²⁵I-LDL binding, uptake, and degradation, respectively, in the mouse liver cell line BPRCL in the presence of increasing amounts of gOBG3.

FIG. 7 shows a protein sequence alignment of the OBG3 clone (OBG3 clone; the insert in FIG. 2) with the published sequences of human (apM1) and mouse (AdipoQ and Acrp30) OBG3. In the alignment, amino acids (AAs) 45 to 110 contain the collagen-like region; AAs 111-247 contain the globular region. The cut sites from lysine-blocked trypsin fall after AAs 58, 61, 95, 103, 115, 125, and 134. As determined by amino-terminal sequencing of the gOBG3 product, the gOBG3 start site is at AA 104 (101 for human gOBG3 or apM1).

FIG. 8 shows a graphical representation of the effect of gOBG3 (3×25 μg ip) on plasma FFA in C57BL6/J mice following a high fat meal (*p<0.02).

FIGS. 9A and 9B show graphical representations of the effect of gOBG3 (3×25 μg ip) on plasma TG in C57BL6/J mice following a high fat meal (p<0.05 at 2, 3 and 4 hours). FIG. 9A shows TG in mg/dl; FIG. 9B shows TG as a percent of the starting value.

FIG. 10 shows a graphical representation of the effect of gOBG3 (3×25 μg ip) on plasma glucose in C57BL6/J mice following a high fat meal.

FIGS. 11A and 11B show graphical representations of the effect of gOBG3 (3×25 μg ip) on plasma FFA in C57BL6/J mice following a high fat meal. FIG. 11A shows FFA as mM; FIG. 11B shows FFA as a percent of the starting value.

FIGS. 12A and 12B show graphical representations of the effect of gOBG3 (3×25 μg) on plasma leptin in C57BL6/J mice following a high fat meal. FIG. 12A shows leptin as ng/mL; FIG. 12B shows leptin as a percent of the starting value.

FIGS. 13A and 13B show graphical representations of the effect of gOBG3 (3×25 μg) on plasma Insulin in C57BL6/J mice following a high fat meal. FIG. 13A shows insulin levels in ng/mL; FIG. 13B shows insulin as a percent of the starting value.

FIGS. 14A and 14B show graphical representations of the effect of OBG3 on plasma FFA in C57BL6/J mice following a high fat meal. At t=2 hours a significant reduction in FFA was seen for both treatment groups (p<0.05). FIG. 14A shows FFA levels in mM; FIG. 14B shows FFA as a percent of the starting value.

FIGS. 15A and 15B show graphical representations of the effect of OBG3 on plasma TG in C57BL6/J mice following a high fat meal. FIG. 15A shows TG levels in mg/dl; FIG. 15B shows TG as a percent of the starting value.

FIGS. 16A and 16B show graphical representations of the effect of OBG3 on plasma glucose in C57BL6/J mice following a high fat meal. FIG. 16A shows glucose levels as mg/dl; FIG. 16B shows glucose levels as a percent of the starting value.

FIG. 17 shows a table identifying additional apM1. SNPs. Information concerning Known Base Changes, Location, Prior Markers, Amplicon, and Forward and Reverse primers for microsequencing are shown.

FIGS. 18A and 18B show graphical representations of the effect of gAcrp30 injection in mice on the FFA (FIG. 18A) and glucose (FIG. 18B) increases resulting from epinephrine injection.

FIG. 19 shows a graphical representation of the effect of gAcrp30 treatment on fatty acid metabolism in muscle isolated from mice.

FIGS. 20A and 20B show a graphical representation of the effect of gAcrp30 treatment on triglyceride content of muscle and liver isolated from mice.

FIGS. 21A, 21B, 21C, & 21D show graphical representations of the effect of gAcrp30 treatment on weight gain & loss in mice. Treatments shown are saline (diamond), Acrp30 (Box), and gAcrp30 (triangle). FIG. 21A shows results of treatment of mice after 19 days on a high fat diet. FIG. 21B shows results of treatment of mice after 6 months on a high fat diet.

FIG. 22 shows a table of the tested blood chemistry values with saline injections, Acrp30 injections, or gAcrp30 injections.

FIGS. 23A and 23B show a SDS-PAGE separation of the purification of Acrp30 and gAcrp30 (23A) and a cleavage product of apM1. (23B). FIG. 23A, Lane II shows the complete form of Acrp30 purified by FPLC. Lane I shows the proteolytic cleavage product gAcrp30. FIG. 23B shows a cleavage product of apM1. after immunoprecipitation followed by Western blotting. The apparent molecular weight of this truncated form is 27 kDa, corresponding to about 70% of the complete form of apM1. (Lane IV). This truncated form was not detectable when a second anti-serum, specific for the human non-homologous region (HDQETTTQGPGVLLPLPKGA) of the protein was used for immunoprecipitation (Lane V) and the same anti-globular head antiserum for detection. A preimmune serum of the same animal did not detect any protein; a dimer of apM1. was seen with both specific antibodies (apparent MW 74 kDa).

FIG. 24 shows a graph depicting the removal of plasma FFAs after Intralipid injection following treatment with gAcrp30 (diamonds) or a saline control (squares).

DETAILED DISCLOSURE OF THE INVENTION

Before describing the invention in greater detail, the following definitions are set forth to illustrate and define the meaning and scope of the terms used to describe the invention herein.

As used interchangeably herein, the terms “oligonucleotides”, and “polynucleotides” and nucleic acid include RNA, DNA, or RNA/DNA hybrid sequences of more than one nucleotide in either single chain or duplex form. The terms encompass “modified nucleotides” which comprise at least one modification, including by way of example and not limitation: (a) an alternative linking group, (b) an analogous form of purine, (c) an analogous form of pyrimidine, or (d) an analogous sugar. For examples of analogous linking groups, purines, pyrimidines, and sugars see for example PCT publication No. WO 95/04064. The polynucleotide sequences of the invention may be prepared by any known method, including synthetic, recombinant, ex vivo generation, or a combination thereof, as well as utilizing any purification methods known in the art.

The terms polynucleotide construct, recombinant polynucleotide and recombinant polypeptide are used herein consistently with their use in the art. The terms “upstream” and “downstream” are also used herein consistently with their use in the art. The terms “base paired” and “Watson & Crick base paired” are used interchangeably herein and consistently with their use in the art. Similarly, the terms “complementary”, “complement thereof”, “complement”, “complementary polynucleotide”, “complementary nucleic acid” and “complementary nucleotide sequence” are used interchangeably herein and consistently with their use in the art.

The term “purified” is used herein to describe a polynucleotide or polynucleotide vector of the invention that has been separated from other compounds including, but not limited to, other nucleic acids, carbohydrates, lipids and proteins (such as the enzymes used in the synthesis of the polynucleotide). Purified can also refer to the separation of covalently closed polynucleotides from linear polynucleotides, or vice versa, for example. A polynucleotide is substantially pure when at least about 50%, 60%, 75%, or 90% of a sample contains a single polynucleotide sequence. In some cases this involves a determination between conformations (linear versus covalently closed). A substantially pure polynucleotide typically comprises about 50, 60, 70, 80, 90, 95, 99% weight/weight of a nucleic acid sample. Polynucleotide purity or homogeneity may be indicated by a number of means well known in the art, such as agarose or polyacrylamide gel electrophoresis of a sample, followed by visualizing a single polynucleotide band upon staining the gel. For certain purposes higher resolution can be provided by using HPLC or other means well known in the art.

Similarly, the term “purified” is used herein to describe a polypeptide of the invention that has been separated from other compounds including, but not limited to, nucleic acids, lipids, carbohydrates and other proteins. In some preferred embodiments, a polypeptide is substantially pure when at least about 50%, 60%, 75%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 99.5% of the polypeptide molecules of a sample have a single amino acid sequencesequence. In some preferred embodiments, a substantially pure polypeptide typically comprises about 50%, 60%, 70%, 80%, 90% 95%, 96%, 97%, 98%, 99% or 99.5% weight/weight of a protein sample. Polypeptide purity or homogeneity is indicated by a number of methods well known in the art, such as agarose or polyacrylamide gel electrophoresis of a sample, followed by visualizing a single polypeptide band upon staining the gel. For certain purposes higher resolution can be provided by using HPLC or other methods well known in the art.

Further, as used herein, the term “purified” does not require absolute purity; rather, it is intended as a relative definition. Purification of starting material or natural material to at least one order of magnitude, preferably two or three orders, and more preferably four or five orders of magnitude is expressly contemplated. Alternatively, purification may be expressed as “at least” a percent purity relative to heterologous polynucleotides (DNA, RNA or both) or polypeptides. As a preferred embodiment, the polynucleotides or polypeptides of the present invention are at least; 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 96% 98%, 99%, 99.5% or 100% pure relative to heterologous polynucleotides or polypeptides. As a further preferred embodiment the polynucleotides or polypeptides have an “at least” purity ranging from any number, to the thousandth position, between 90% and 100% (e.g., at least 99.995% pure) relative to heterologous polynucleotides or polypeptides. Additionally, purity of the polynucleotides or polypeptides may be expressed as a percentage (as described above) relative to all materials and compounds other than the carrier solution. Each number, to the thousandth position, may be claimed as individual species of purity.

The term “isolated” requires that the material be removed from its original environment (e.g., the natural environment if it is naturally occurring). For example, a naturally-occurring polynucleotide or polypeptide present in a living animal is not isolated, but the same polynucleotide or DNA or polypeptide, separated from some or all of the coexisting materials in the natural system, is isolated. Such polynucleotide could be part of a vector and/or such polynucleotide or polypeptide could be part of a composition, and still be isolated in that the vector or composition is not part of its natural environment.

Specifically excluded from the definition of “isolated” are: naturally occurring chromosomes (e.g., chromosome spreads), artificial chromosome libraries, genomic libraries, and cDNA libraries that exist either as an in vitro nucleic acid preparation or as a transfected/transformed host cell preparation, wherein the host cells are either an in vitro heterogeneous preparation or plated as a heterogeneous population of single colonies. Also specifically excluded are the above libraries wherein a 5′ EST makes up less than 5% (or alternatively 1%, 2%, 3%, 4%, 10%, 25%, 50%, 75%, or 90%, 95%, or 99%) of the number of nucleic acid inserts in the vector molecules. Further specifically excluded are whole cell genomic DNA or whole cell RNA preparations (including said whole cell preparations which are mechanically sheared or enzymatically digested). Further specifically excluded are the above whole cell preparations as either an in vitro preparation or as a heterogeneous mixture separated by electrophoresis (including blot transfers of the same) wherein the polynucleotide of the invention have not been further separated from the heterologous polynucleotides in the electrophoresis medium (e.g., further separating by excising a single band from a heterogeneous band population in an agarose gel or nylon blot).

The term “primer” denotes a specific oligonucleotide sequence which is complementary to a target nucleotide sequence and used to hybridize to the target nucleotide sequence. A primer serves as an initiation point for nucleotide polymerization catalyzed by DNA polymerase, RNA polymerase, or reverse transcriptase.

The term “probe” denotes a defined nucleic acid segment (or nucleotide analog segment, e.g., PNA as defined hereinbelow) which can be used to identify a specific polynucleotide sequence present in a sample, said nucleic acid segment comprising a nucleotide sequence complementary to the specific polynucleotide sequence to be identified.

The term “polypeptide” refers to a polymer of amino acids without regard to the length of the polymer. Thus, peptides, oligopeptides, and proteins are included within the definition of polypeptide. This term also does not specify or exclude post-expression modifications of polypeptides. For example, polypeptides that include the covalent attachment of glycosyl groups, acetyl groups, phosphate groups, lipid groups and the like are expressly encompassed by the term polypeptide. Also included within the definition are polypeptides which contain one or more analogs of an amino acid (including, for example, non-naturally occurring amino acids, amino acids which only occur naturally in an unrelated biological system, modified amino acids from mammalian systems etc.), polypeptides with substituted linkages, as well as other modifications known in the art, both naturally occurring and non-naturally occurring. As used herein, the term “OBG3” refers generically to the murine or human form of an OBG3 gene or gene product, unless otherwise specified. The terms “Acrp30” and “AdipoQ” refer specifically to the murine form of OBG3 and the term “apM1” refers specifically to the human form of OBG3.

Without being limited by theory, the compounds/polypeptides of the invention are capable of modulating the partitioning of dietary lipids between the liver and peripheral tissues, and are thus believed to treat “diseases involving the partitioning of dietary lipids between the liver and peripheral tissues.” The term “peripheral tissues” is meant to include muscle and adipose tissue. In preferred embodiments, the compounds/polypeptides of the invention partition the dietary lipids toward the muscle. In alternative preferred embodiments, the dietary lipids are partitioned toward the adipose tissue. In other preferred embodiments, the dietary lipids are partitioned toward the liver. In yet other preferred embodiments, the compounds/polypeptides of the invention increase or decrease the oxidation of dietary lipids, preferably free fatty acids (FFA) by the muscle. Dietary lipids include, but are not limited to triglycerides and free fatty acids.

Preferred diseases believed to involve the partitioning of dietary lipids include obesity and obesity-related diseases and disorders such as obesity, insulin resistance, atherosclerosis, atheromatous disease, heart disease, hypertension, stroke, Syndrome X, Noninsulin Dependent Diabetes Mellitus (NIDDM, or Type II diabetes) and Insulin Dependent Diabetes Mellitus (IDDM or Type I diabetes). Diabetes-related complications to be treated by the methods of the invention include microangiopathic lesions, ocular lesions, retinopathy, neuropathy, and renal lesions. Heart disease includes, but is not limited to, cardiac insufficiency, coronary insufficiency, and high blood pressure. Other obesity-related disorders to be treated by compounds of the invention include hyperlipidemia and hyperuricemia. Yet other obesity-related diseases or disorders of the invention include cachexia, wasting, AIDS-related weight loss, anorexia, and bulimia.

The term “heterologous”, when used herein, is intended to designate any polypeptide or polynucleotide other than an OBG3 or OBG3 polypeptide or a polynucleotide encoding an OBG3 or gOBG3 polypeptide of the present invention.

The terms “comprising”, “consisting of” and “consisting essentially of” are defined according to their standard meaning. A defined meaning set forth in the M.P.E.P. controls over a defined meaning in the art and a defined meaning set forth in controlling Federal Circuit case law controls over a meaning set forth in the M.P.E.P. With this in mind, the terms may be substituted for one another throughout the instant application in order to attach the specific meaning associated with each term.

The term “host cell recombinant for” a particular polynucleotide of the present invention, means a host cell that has been altered by the hands of man to contain said polynucleotide in a way not naturally found in said cell. For example, said host cell may be transiently or stably transfected or transduced with said polynucleotide of the present invention.

The term “obesity” as used herein is defined in the WHO classifications of weight (Kopelman (2000) Nature 404:635643). Underweight is less than 18.5 (thin); Healthy is 18.5-24.9 (normal); grade 1 overweight is 25.0-29.9 (overweight); grade 2 overweight is 30.0-39.0 (obese); grade 3 overweight is greater than or equal to 40.0 BMI (morbid obesity). BMI is body mass index and is measured as kg/m². Waist circumference can also be used to indicate a risk of metabolic complications where in men a circumference of greater than or equal to 94 cm indicates an increased risk, and greater than or equal to 102 cm indicates a substantially increased risk. Similarly for women, greater than or equal to 88 cm indicates an increased risk, and greater than or equal to 88 cm indicates a substantially increased risk. The waist circumference is measured in cm at midpoint between lower border of ribs and upper border of the pelvis. Other measures of obesity include, but are not limited to, skinfold thickness which is a measurement in cm of skinfold thickness using calipers, and bioimpedance, which is based on the principle that lean mass conducts current better than fat mass because it is primarily an electrolyte solution; measurement of resistance to a weak current (impedance) applied across extremities provides an estimate of body fat using an empirically derived equation.

The term “agent acting on the partitioning of dietary lipids between the liver and peripheral tissues” refers to a compound or polypeptide of the invention that modulates the partitioning of dietary lipids between the liver and the peripheral tissues as previously described. Preferably, the agent increases or decreases the oxidation of dietary lipids, preferably free fatty acids (FFA) by the muscle. Preferably the agent decreases or increases the body weight of individuals or is used to treat or prevent an obesity-related disease or disorder such as obesity, insulin resistance, atherosclerosis, atheromatous disease, heart disease, hypertension, stroke, Syndrome X, Noninsulin Dependent Diabetes Mellitus (NDDM, or Type II diabetes) and Insulin Dependent Diabetes Mellitus (IDDM or Type I diabetes). Diabetes-related complications to be treated by the methods of the invention include microangiopathic lesions, ocular lesions, retinopathy, neuropathy, and renal lesions. Heart disease includes, but is not limited to, cardiac insufficiency, coronary insufficiency, and high blood pressure. Other obesity-related disorders to be treated by compounds of the invention include hyperlipidemia and hyperuricemia. Yet other obesity-related diseases or disorders of the invention include cachexia, wasting, AIDS-related weight loss, anorexia, and bulimia.

The terms “response to an agent acting on the partitioning of dietary lipids between the liver and peripheral tissues” refer to drug efficacy, including but not limited to, ability to metabolize a compound, ability to convert a pro-drug to an active drug, and the pharmacokinetics (absorption, distribution, elimination) and the pharmacodynamics (receptor-related) of a drug in an individual.

The terms “side effects to an agent acting on the partitioning of dietary lipids between the liver and peripheral tissues” refer to adverse effects of therapy resulting from extensions of the principal pharmacological action of the drug or to idiosyncratic adverse reactions resulting from an interaction of the drug with unique host factors. “Side effects to an agent acting on the partitioning of dietary lipids between the liver and peripheral tissues” can include, but are not limited to, adverse reactions such as dermatologic, hematologic or hepatologic toxicities and further includes gastric and intestinal ulceration, disturbance in platelet function, renal injury, nephritis, vasomotor rhinitis with profuse watery secretions, angioneurotic edema, generalized urticaria, and bronchial asthma to laryngeal edema and bronchoconstriction, hypotension, and shock.

The term “OBG3-related diseases and disorders” as used herein refers to any disease or disorder comprising an aberrant functioning of OBG3, or which could be treated or prevented by modulating OBG3 levels or activity. “Aberrant functioning of OBG3” includes, but is not limited to, aberrant levels of expression of OBG3 (either increased or decreased, but preferably decreased), aberrant activity of OBG3 (either increased or decreased), and aberrant interactions with ligands or binding partners (either increased or decreased). By “aberrant” is meant a change from the type, or level of activity seen in normal cells, tissues, or patients, or seen previously in the cell, tissue, or patient prior to the onset of the illness. In preferred embodiments, these OBG3-related diseases and disorders include obesity and the obesity-related diseases and disorders described previously.

The term “obesity-related” diseases and disorders as used herein refers to obesity-related diseases and disorders such as obesity, insulin resistance, atherosclerosis, atheromatous disease, heart disease, hypertension, stroke, Syndrome X, Noninsulin Dependent Diabetes Mellitus (NIDDM, or Type II diabetes), and Insulin Dependent Diabetes Mellitus (IDDM or Type I diabetes). Other obesity-related disorders to be treated by compounds of the invention include hyperlipidemia and hyperuricemia. Yet other obesity-related diseases or disorders of the invention include cachexia, wasting, AIDS-related weight loss, anorexia, and bulimia. Diabetes-related complications to be treated by the methods of the invention include microangiopathic lesions, ocular lesions, retinopathy, neuropathy, and renal lesions. Heart disease includes, but is not limited to, cardiac insufficiency, coronary insufficiency, and high blood pressure.

The term “cosmetic treatments” is meant to include treatments with compounds or polypeptides of the invention that increase or decrease the body mass of an individual where the individual is not clinically obese or clinically thin. Thus, these individuals have a body mass index (BMI) below the cut-off for clinical obesity (e.g. below 25 kg/m²) and above the cut-off for clinical thinness (e.g. above 18.5 kg/m²). In addition, these individuals are preferably healthy (e.g. do not have an obesity-related disease or disorder of the invention). “Cosmetic treatments” are also meant to encompass, in some circumstances, more localized increases in adipose tissue, for example, gains or losses specifically around the waist or hips, or around the hips and thighs, for example. These localized gains or losses of adipose tissue can be identified by increases or decreases in waist or hip size, for example.

The term “preventing” as used herein refers to administering a compound prior to the onset of clinical symptoms of a disease or condition so as to prevent a physical manifestation of aberrations associated with obesity or OBG3. Alternatively, the term “preventing” can also be used to signify the reduction, or severity, of clinical symptoms associated with a disease or condition.

The term “treating” as used herein refers to administering a compound after the onset of clinical symptoms.

The term “in need of treatment” as used herein refers to a judgment made by a caregiver (e.g. physician, nurse, nurse practitioner, etc in the case of humans; veterinarian in the case of animals, including non-human mammals) that an individual or animal requires or will benefit from treatment. This judgment is made based on a variety of factors that are in the realm of a caregiver's expertise, but that include the knowledge that the individual or animal is ill, or will be ill, as the result of a condition that is treatable by the compounds of the invention.

The term “perceives a need for treatment” refers to a sub-clinical determination that an individual desires to reduce weight for cosmetic reasons as discussed under “cosmetic treatment” above. The term “perceives a need for treatment” in other embodiments can refer to the decision that an owner of an animal makes for cosmetic treatment of the animal.

The term “individual” or “patient” as used herein refers to any animal, including mammals, preferably mice, rats, other rodents, rabbits, dogs, cats, swine, cattle, sheep, horses, or primates, and most preferably humans. The term may specify male or female or both, or exclude male or female.

The term “non-human animal” refers to any non-human vertebrate, including birds and more usually mammals, preferably primates, animals such as swine, goats, sheep, donkeys, horses, cats, dogs, rabbits or rodents, more preferably rats or mice. Both the terms “animal” and “mammal” expressly embrace human subjects unless preceded with the term “non-human”.

The inventors have found that a fragment of OBG3, called gOBG3, is able to significantly reduce the postprandial response of plasma free fatty acids, glucose, and triglycerides in mice fed a high fat/sucrose meal. There was no significant effect on leptin, insulin or glucagon levels. In addition, gOBG3 was found to increase muscle free fatty acid oxidation in vitro and ex vivo. Further, gOBG3 was shown to decrease and then to prevent an increase in weight gain in mice that had been fed a high fat/sucrose diet for 19 days. In mice that had been maintained on the same high fat/sucrose diet for 6 months, gOBG3 treatment resulted in a sustained weight loss over 16 days that was significant, despite being maintained on the high fat/sucrose diet.

The instant invention encompasses the use of OBG3 polypeptide fragments in the partitioning of free fatty acid (FFA) and as an important new tool to control energy homeostasis. Of the tissues that can significantly remove lipids from circulation and cause FFA oxidation, muscle is quantitatively the most important. Globular OBG3 is a unique and novel pharmacological tool that controls body weight without interfering with food intake.

PREFERRED EMBODIMENTS OF THE INVENTION

I. OBG3 Polypeptide Fragments of the Invention

OBG3 polypeptide fragments that have measurable activity in vitro and in vivo have been identified. These activities include, but are not limited to, reduction of the postprandial response of plasma free fatty acids, glucose, and triglycerides in mice fed a high fat/sucrose meal (Example 8), increase in muscle free fatty acid oxidation in vitro and ex vivo (Example 12), and sustained weight loss in mice on a high fat/sucrose diet (Example 14). Other assays for OBG3 polypeptide fragment activity in vitro and in vivo are also provided (Examples 4, 7, 9, 11, 13, for example), and equivalent assays can be designed by those with skill in the art.

In contrast, the “intact” or “full-length” OBG3 polypeptide does not have either the in vivo or the in vitro activities that have been identified for OBG3 and gOBG3 polypeptide fragments of the invention. In most cases, the activities are either not present or at a minimum are undetectable over control values in the assays used. In other cases, the activities can be measured, but are present either at extremely reduced levels and/or require significantly more protein on a molar basis compared with the OBG3 and gOBG3 polypeptide fragments of the invention (see, e.g. Example 10). By “intact” or “full-length” OBG3 polypeptide as used herein is meant the full length polypeptide sequence of any OBG3 polypeptide, from the N-terminal methionine to the C-terminal stop codon. Examples of intact or fall length OBG3 polypeptides are found in SEQ ID NO:2 (mouse), SEQ ID NO:4 (mouse), and SEQ ID NO:6 (human). The term “OBG3 polypeptide fragments” as used herein refers to fragments of the “intact” or “full-length” OBG3 polypeptide that have “obesity-related activity”. The term “gOBG3 polypeptide fragments” refers to polypeptide fragments of the globular region only and is thus a narrower term than “OBG3 polypeptide fragments”. The term “fragment” means a polypeptide having a sequence that is entirely the same as part, but not all, of an intact or full-length OBG3 polypeptide. Such fragments may be “free-standing” (i.e. not part of or fused to other polypeptides), or one or more fragments may be present in a single polypeptide. OBG3 or gOBG3 fragments contiguous fragments of the full length OBG3 polypeptide unless otherwise specified.

The term “obesity-related activity” as used herein refers to at least one, and preferably all, of the activities described herein for OBG3 polypeptide fragments. Assays for the determination of these activities are provided herein (e.g. Examples 4, 7-9, 11-14), and equivalent assays can be designed by those with ordinary skill in the art. Optionally, “obesity-related activity” can be selected from the group consisting of lipid partitioning, lipid metabolism, and insulin-like activity, or an activity within one of these categories. By “lipid partitioning” activity is meant the ability to effect the location of dietary lipids among the major tissue groups including, adipose tissue, liver, and muscle. The inventors have shown that OBG3 polypeptide fragments of the invention play a role in the partitioning of lipids to the muscle, liver or adipose tissue. By “lipid metabolism” activity is meant the ability to influence the metabolism of lipids. The inventors have shown that OBG3 polypeptide fragments of the invention have the ability to affect the level of free fatty acids in the plasma as well as to increase the metabolism of lipids in the muscle through free fatty acid oxidation experiments (Examples 4, 8, 10, 11, 12) and to transiently affect the levels of triglycerides in the plasma and the muscle (Examples 8, 10-13). By “insulin-like” activity is meant the ability of OBG3 polypeptide fragments to modulate the levels of glucose in the plasma. The inventors have found that OBG3 polypeptide fragments do not significantly impact insulin levels but do impact glucose levels similarly to the effects of insulin (Examples 9 & 10). These effects are not seen in the presence of the intact (full-length) OBG3 polypeptide or are significantly greater in the presence of the OBG3 polypeptide fragments compared with the full-length OBG3 polypeptide.

The term “significantly greater” as used herein refers to a comparison of the activity of an OBG3 polypeptide fragment in an obesity-related assay compared with the activity of a full-length OBG3 polypeptide in the same assay. By “significantly” as used herein is meant statistically significant as it is typically determined by those with ordinary skill in the art. For example, data are typically calculated as a mean±SEM, and a p-value ≦0.05 is considered statistically significant. Statistical analysis is typically done using either the unpaired Student's t test or the paired Student's t test, as appropriate in each study. Examples of a significant change in activity as a result of the presence of an OBG3 polypeptide fragment of the invention compared to the presence of a full-length OBG3 polypeptide include an increase or a decrease in a given parameter of at least 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, or 75. One or more, but not necessarily all, of the measurable parameters will change significantly in the presence of OBG3 polypeptide fragments as compared to in the presence of an intact OBG3 polypeptide.

Representative “obesity-related assays” are provided in Examples 4, 7-9, and 11 -14. These assays include, but are not limited to, methods of measuring the postprandial response, methods of measuring free fatty acid oxidation, and methods of measuring weight modulation. In preferred embodiments, the post-prandial response is measured in non-human animals, preferably mice. In preferred embodiments changes in dietary lipids are measured, preferably free fatty acids and/or triglycerides. In other embodiments, other physiologic parameters are measured including, but not limited to, levels of glucose, insulin, and leptin. In other preferred embodiments, free fatty acid oxidation is measured in cells in vitro or ex vivo, preferably in muscle cells or tissue of non-human animals, preferably mice. In yet other preferred embodiments weight modulation is measured in human or non-human animals, preferably rodents (rats or mice), primates, canines, felines or procines on a high fat/sucrose diet. Optionally, “obesity-related activity” includes other activities not specifically identified herein. In general, “measurable parameters” relating to obesity and the field of metabolic research can be selected from the group consisting of free fatty acid levels, free fatty acid oxidation, triglyceride levels, glucose levels, insulin levels, leptin levels, food intake, weight, leptin and lipoprotein binding, uptake and degradation and LSR expression.

In these obesity-related assays, preferred OBG3 polypeptide fragments of the invention, but not full-length OBG3 polypeptides, would cause a significant change in at least one of the measurable parameters selected from the group consisting of post-prandial lipemia, free fatty acid levels, triglyceride levels, glucose levels, free fatty acid oxidation, and weight. Alternatively, preferred OBG3 polypeptide fragments of the invention, but not full-length OBG3 polypeptides, would have a significant change in at least one of the measurable parameters selected from the group consisting of an increase in LSR activity, an increase in leptin activity and an increase in lipoprotein activity. By “LSR” activity is meant expression of LSR on the surface of the cell, or in a particular conformation, as well as its ability to bind, uptake, and degrade leptin and lipoprotein. By “leptin” activity is meant its binding, uptake and degradation by LSR, as well as its transport across a blood brain barrier, and potentially these occurrences where LSR is not necessarily the mediating factor or the only mediating factor. Similarly, by “lipoprotein” activity is meant its binding, uptake and degradation by LSR, as well as these occurrences where LSR is not necessarily the mediating factor or the only mediating factor.

The invention is drawn, inter alia, to isolated, purified or recombinant OBG3 polypeptide fragments. OBG3 polypeptide fragments of the invention are useful for reducing or increasing (using antagonists of OBG3 polypeptides) body weight either as a cosmetic treatment or for treatment or prevention of obesity-related diseases and disorders. OBG3 polypeptide fragments are also useful inter alia in screening assays for agonists or antagonists of OBG3 fragment activity, for raising OBG3 fragment-specific antibodies, and in diagnostic assays. When used for cosmetic treatments, or for the treatment or prevention of obesity-related diseases, disorders, or conditions, one or more OBG3 polypeptide fragments can be provided to a subject. Thus, various fragments of the full-length protein can be combined into a “cocktail” for use in the various treatment regimens.

The full-length OBG3 polypeptide is comprised of at least four distinct regions including:

1. an N-terminal putative signal sequence from amino acids 1-17 of SEQ ID NO:6, SEQ ID NO:2, or SEQ ID NO:4;

2. a unique region from amino acids 18-41 of SEQ ID NO:6 or 18-44 of SEQ ID NO:2, or SEQ ID NO:4;

3. a collagen-like region from amino acids 42-107 of SEQ ID NO:6 or 45-110 of SEQ ID NO:2 or SEQ ID NO:4; and

4. a globular region from amino acids 108-244 of SEQ ID NO:6 or 111-247 of SEQ ID NO:2 or SEQ ID NO:4.

OBG3-related diseases and disorders include, but are not limited to any disease or disorder comprising an aberrant functioning of OBG3, or which could be treated or prevented by modulating OBG3 levels or activity. Aberrant functioning of OBG3 includes, but is not limited to, aberrant levels of expression of OBG3 (either increased or decreased, but preferably decreased), aberrant activity of OBG3 (either increased or decreased), and aberrant interactions with ligands or binding partners (either increased or decreased), where aberrant refers to a change from the type, or level of activity seen in normal cells, tissues, or patients, or seen previously in the cell, tissue, or patient prior to the onset of the illness. In preferred embodiments, these OBG3-related diseases and disorders include obesity and obesity-related diseases and disorders including, but not limited to insulin resistance, atherosclerosis, atheromatous disease, heart disease, hypertension, stroke, Syndrome X, Noninsulin Dependent Diabetes Mellitus (NIDDM, or Type II diabetes), and Insulin Dependent Diabetes Mellitus (IDDM or Type I diabetes). Other obesity-related disorders to be treated by compounds of the invention include hyperlipidemia and hyperuricemia. Yet other obesity-related diseases or disorders of the invention include cachexia, wasting, AIDS-related weight loss, anorexia, and bulimia. Diabetes-related complications to be treated by the methods of the invention include microangiopathic lesions, ocular lesions, retinopathy, neuropathy, and renal lesions. Heart disease includes, but is not limited to, cardiac insufficiency, coronary insufficiency, and high blood pressure.

The term “collagen residues” is used in the manner standard in the art to mean the amino acid triplet glycine, X, Y, where X and Y can be any amino acid.

The OBG3 polypeptide fragments of the present invention are preferably provided in an isolated form, and may be partially or substantially purified. A recombinantly produced version of an OBG3 polypeptide fragment can be substantially purified by the one-step method described by Smith et al. ((1988) Gene 67(1):31-40) or by the methods described herein or known in the art (see, e.g., Examples 1-3). Fragments of the invention also can be purified from natural or recombinant sources using antibodies directed against the polypeptide fragments of the invention by methods known in the art of protein purification.

Preparations of OBG3 polypeptide fragments of the invention involving a partial purification of or selection for the OBG3 polypeptide fragments are also specifically contemplated. These crude preparations are envisioned to be the result of the concentration of cells expressing OBG3 polypeptide fragments with perhaps a few additional purification steps, but prior to complete purification of the fragment. The cells expressing OBG3 polypeptide fragments are present in a pellet, they are lysed, or the crude polypeptide is lyophilized, for example.

OBG3 or gOBG3 polypeptide fragments can be any integer in length from at least 6 consecutive amino acids to 1 amino acids less than a full length OBG3 polypeptide. Thus, for human OBG3 (SEQ ID NO: 6), an OBG3 or gOBG3 polypeptide fragment can be any integer of consecutive amino acids from 6 to 243; for mouse OBG3 (SEQ ID NO:2 or SEQ ID NO:4) an OBG3 or gOBG3 fragment can be any integer of consecutive amino acids from 6 to 246, for example. The term “integer” is used herein in its mathematical sense and thus representative integers include: 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, and 246.

Each OBG3 fragment as described above can be further specified in terms of its N-terminal and C-terminal positions. For example, every combination of a N-terminal and C-terminal position that fragments of from 6 contiguous amino acids to 1 amino acids less than the full length OBG3 polypeptide could occupy, on any given intact and contiguous full length OBG3 polypeptide sequence are included in the present invention. Thus, a 6 consecutive amino acid fragment could occupy positions selected from the group consisting of 1-6, 2-7, 3-8, 4-9, 5-10, 6-11, 7-12, 8-13, 9-14, 10-15, 11-16, 12-17, 13-18, 14-19, 15-20, 16-21, 17-22, 18-23, 19-24, 20-25, 21-26, 22-27, 23-28, 24-29, 25-30, 26-31, 27-32, 28-33, 29-34, 30-35, 31-36, 32-37, 33-38, 34-39, 35-40, 36-41, 37-42, 38-43, 39-44, 40-45, 41-46, 42-47, 43-48, 44-49, 45-50, 46-51, 47-52, 48-53, 49-54, 50-55, 51-56, 52-57, 53-58, 54-59, 55-60, 56-61, 57-62, 58-63, 59-64, 60-65, 61-66, 62-67, 63-68, 64-69, 65-70, 66-71, 67-72, 68-73, 69-74, 70-75, 71-76, 72-77, 73-78, 74-79, 75-80, 76-81, 77-82, 78-83, 79-84, 80-85, 81-86, 82-87, 83-88, 84-89, 85-90, 86-91, 87-92, 88-93, 89-94, 90-95, 91-96, 92-97, 93-98, 94-99, 95-100, 96-101, 97-102, 98-103, 99-104, 100-105, 101-106, 102-107, 103-108, 104-109, 105-110, 106-111, 107-112, 108-113, 109 -114, 110-115, 111-116, 112-117, 113-118, 114-119, 115-120, 116-121, 117-122, 118-123, 119-124, 120-125, 121-126, 122-127, 123-128, 124-129, 125-130, 126-131, 127-132, 128 -133, 129-134, 130-135, 131-136, 132-137, 133-138, 134-139, 135-140, 136-141, 137-142, 138-143, 139-144, 140-145, 141-146, 142-147, 143-148, 144-149, 145-150, 146-151, 147 -152, 148-153, 149-154, 150-155, 151-156, 152-157, 153-158, 154-159, 155-160, 156-161, 157-162, 158-163, 159-164, 160-165, 161-166, 162-167, 163-168, 164-169, 165-170, 166 -171, 167-172, 168-173, 169-174, 170-175, 171-176, 172-177, 173-178, 174-179, 175-180, 176-181, 177-182, 178-183, 179-184, 180-185, 181-186, 182-187, 183-188, 184-189, 185 -190, 186-191, 187-192, 188-193, 189-194, 190-195, 191-196, 192-197, 193-198, 194-199, 195-200, 196-201, 197-202, 198-203, 199-204, 200-205, 201-206, 202-207, 203-208, 204 -209, 205-210, 206-211, 207-212, 208-213, 209-214, 210-215, 211-216, 212-217, 213-218, 214-219, 215-220, 216-221, 217-222, 218-223, 219-224, 220-225, 221-226, 222-227, 223 -228, 224-229, 225-230, 226-231, 227-232, 228-233, 229-234, 230-235, 231-236, 232-237, 233-238, 234-239, 235-240, 236-241, 237-242, 238-243, and 239-244 of SEQ ID NO:6. A 238 consecutive amino acid fragment could occupy positions selected from the group consisting of 1-238, 2-239, 3-240, 4-241, 5-242, 6-243 and 7-244 of SEQ ID NO:6. Similarly, the positions occupied by all the other fragments of sizes between 6 amino acids and 243 amino acids on SEQ ID NO:6 are included in the present invention and can also be immediately envisaged based on these two examples and therefore, are not individually listed solely for the purpose of not unnecessarily lengthening the specification. Furthermore, the positions occupied by fragments of 6 to 241 consecutive amino acids on SEQ ID NO:2 or SEQ ID NO:4 are included in the present invention and can also be immediately envisaged based on these two examples and therefore are not individually listed solely for the purpose of not unnecessarily lengthening the specification. In addition, the positions occupied by fragments of 6 consecutive amino acids to 1 amino acid less than any other full length OBG3 polypeptide can also be envisaged based on these two examples and therefore are not individually listed solely for the purpose of not unnecessarily lengthening the specification.

The OBG3 or gOBG3 polypeptide fragments of the present invention may alternatively be described by the formula “n to c” (inclusive); where “n” equals the N-terminal most amino acid position (as defined by the sequence listing) and “c” equals the C-terminal most amino acid position (as defined by the sequence listing) of the polypeptide; and her where “n” equals an integer between 1 and the number of amino acids of the full length polypeptide sequence of the present invention minus 6 (238 for SEQ ID NO: 6 and 241 for SEQ ID NOs: 2 or 4); and where “c” equals an integer between 7 and the number of amino acids of the full length polypeptide sequence (244 for SEQ ID NO: 6 and 247 for SEQ ID NOs: 2 or 4); and where “n” is an integer smaller then “c” by at least 6. Therefore, for SEQ ID NO: 6, “n” is any integer selected from the list consisting of: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 234, 235, 236, 237 and 238; and “c” is any integer selected from the group consisting of: 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244. Every combination of “n” and “c” positions are included as specific embodiments of the invention. Moreover, the formula “n” to “c” may be modified as ‘“n1-n2” to “c1-c2”’, wherein “n1-n2” and “c1-c2” represent positional ranges selected from any two integers above which represent amino acid positions of the sequence listing. Alternative formulas include ‘“n1-n2” to “c”’ and ‘“n” to “c1-c2”’.

These specific embodiments, and other polypeptide or polynucleotide fragment embodiments described herein may be modified as being “at least”, “equal to”, “equal to or less than”, “less than”, “at least ______ but not greater than ______” or “from ______ to ______” a specified size or specified N-terminal and/or C-terminal positions, or 5′ and/or 3′ position, respectively. It is noted that all ranges used to describe any embodiment of the present invention are inclusive unless specifically set forth otherwise.

The present invention also provides for the exclusion of any individual fragment specified by N-terminal and C-terminal positions or of any fragment specified by size in amino acid residues as described above. In addition, any number of fragments specified by N-terminal and C-terminal positions or by size in amino acid residues as described above may be excluded as individual species. Further, any number of fragments specified by N-terminal and C-terminal positions or by size in amino acid residues as described above may make up a polypeptide fragment in any combination and may optionally include non-OBG3 polypeptide sequence as well.

In particularly preferred embodiments, the OBG3 polypeptide fragment is a “globular OBG3” (gOBG3) fragment. The term “gOBG3 fragment” or “gOBG3” or “gOBG3 polypeptide” as used herein refers to fragments of a full-length OBG3 polypeptide that comprise at least 6 and any other integer number of amino acids up to 137 of the globular region of a full-length OBG3 polypeptide (defined above). In preferred embodiments, gOBG3 polypeptide fragments also comprise at least 1 and any other integer number of amino acids up 66 of the collagen region of a full-length OBG3 polypeptide, preferably 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 consecutive amino acid residues from the collagen region of the intact OBG3 polypeptide that are adjacent to the globular region. By “adjacent” to the globular region is meant the first collagen amino acid immediately N-terminal to the globular region and adding each collagen amino acid consecutively in the N-terminal direction. Thus, for example, if there is only one collagen amino acid in the gOBG3 polypeptide fragment, it is the collagen amino acid 107 of SEQ ID NO: 6 or amino acid 110 of SEQ ID NO:2 or SEQ ID NO:4 located adjacent and 5′ to the first amino acid of the globular region. If there are 20 collagen amino acids adjacent to the globular region in the gOBG3 fragment they would be the collagen amino acids 88-107 of SEQ ID NO: 6 or amino acids 91-110 of SEQ ID NO:2 or SEQ ID NO:4.

In other preferred embodiments, gOBG3 polypeptide fragments are selected from amino acids 101 to 244, 108 to 244, or 132 to 244 of SEQ ID NO:6 and amino acids 104 to 247, 111 to 247, or 135 to 247 of SEQ ID NO:2 or SEQ ID NO:4. In yet other preferred embodiments, the invention features a gOBG3 polypeptide fragment comprising at least 115, but not more than 175 contiguous amino acids of any one of the gOBG3 fragment sequences set forth in FIG. 1, wherein no more than 12 of said at least 115 and no more than 175 contiguous amino acids are present in the collagen-like region of OBG3. Preferably, the gOBG3 polypeptide fragment comprises at least 125, but not more than 165, or at least 135, but not more than 155 amino acids, and no more than 9 amino acids are in the collagen-like region; more preferably at least 125 but not more than 165, or 135 but not more than 155 amino acids, and no more than 6 amino acids are in the collagen-like region; or at least 140 and not more than 150 amino acids, and no more than 3 amino acids are present in the collagen-like region. Preferably the gOBG3 fragment is mammalian, preferably human or mouse, but most preferably human.

OBG3 and gOBG3 polypeptide fragments of the invention include variants, fragments, analogs and derivatives of the OBG3 and gOBG3 polypeptide fragments described above, including modified OBG3 and gOBG3 polypeptide fragments.

Variants

It will be recognized by one of ordinary skill in the art that some amino acids of the OBG3 and gOBG3 fragment sequences of the present invention can be varied without significant effect on the structure or function of the protein; there will be critical amino acids in the fragment sequence that determine activity. Thus, the invention further includes variants of OBG3 and gOBG3 polypeptide fragments that have obesity-related activity as described above. Such variants include OBG3 fragment sequences with one or more amino acid deletions, insertions, inversions, repeats, and substitutions either from natural mutations or human manipulation selected according to general rules known in the art so as to have little effect on activity. Guidance concerning how to make phenotypically silent amino acid substitutions is provided below.

There are two main approaches for studying the tolerance of an amino acid sequence to change (see, Bowie et al. (1990) Science, 247, 1306-10). The first method relies on the process of evolution, in which mutations are either accepted or rejected by natural selection. The second approach uses genetic engineering to introduce amino acid changes at specific positions of a cloned gene and selections or screens to identify sequences that maintain functionality.

These studies have revealed that proteins are surprisingly tolerant of amino acid substitutions and indicate which amino acid changes are likely to be permissive at a certain position of the protein. For example, most buried amino acid residues require nonpolar side chains, whereas few features of surface side chains are generally conserved. Other such phenotypically silent substitutions are described by Bowie et al. 1990 (supra) and the references cited therein.

Typically seen as conservative substitutions are the replacements, one for another, among the aliphatic amino acids Ala, Val, Leu and Phe; interchange of the hydroxyl residues Ser and Thr; exchange of the acidic residues Asp and Glu; substitution between the amide residues Asn and Gln; exchange of the basic residues Lys and Arg; and replacements among the aromatic residues Phe, Tyr. In addition, the following groups of amino acids generally represent equivalent changes: (1) Ala, Pro, Gly, Glu, Asp, Gln, Asn, Ser, Thr; (2) Cys, Ser, Tyr, Thr; (3) Val, Ile, Leu, Met, Ala, Phe; (4) Lys, Arg, His; (5) Phe, Tyr, Trp, His.

Similarly, amino acids in the OBG3 and gOBG3 polypeptide fragment sequences of the invention that are essential for function can also be identified by methods known in the art, such as site-directed mutagenesis or alanine-scanning mutagenesis (see, e.g., Cunningham, et al. (1989) Science 244(4908):1081-5). The latter procedure introduces single alanine mutations at every residue in the molecule. The resulting mutant molecules are then tested for obesity-related activity using assays as described above. Of special interest are substitutions of charged amino acids with other charged or neutral amino acids that may produce proteins with highly desirable improved characteristics, such as less aggregation. Aggregation may not only reduce activity but also be problematic when preparing pharmaceutical or physiologically acceptable formulations, because aggregates can be immunogenic (see, e.g., Pinckard, et al., (1967) Clin. Exp. Immunol 2:331-340; Robbins, et al., (1987) Diabetes July;36(7):838-41; and Cleland, et al., (1993) Crit Rev Ther Drug Carrier Syst. 10(4):307-77).

Thus, the fragment, derivative, analog, or homolog of the OBG3 or gOBG3 fragment of the present invention may be, for example: (i) one in which one or more of the amino acid residues are substituted with a conserved or non-conserved amino acid residue (preferably a conserved amino acid residue) and such substituted amino acid residue may or may not be one encoded by the genetic code (i.e. may be a non-naturally occurring amino acid); or (ii) one in which one or more of the amino acid residues includes a substituent group; or (iii) one in which the OBG3 or gOBG3 fragment is fused with another compound, such as a compound to increase the half-life of the fragment (for example, polyethylene glycol); or (iv) one in which the additional amino acids are fused to the above form of the fragment, such as an IgG Fc fusion region peptide or leader or secretory sequence or a sequence which is employed for purification of the above form of the fragment or a pro-protein sequence. Such fragments, derivatives and analogs are deemed to be within the scope of those skilled in the art from the teachings herein.

A further embodiment of the invention relates to a polypeptide which comprises the amino acid sequence of an OBG3 or gOBG3 polypeptide fragment having an amino acid sequence which contains at least one conservative amino acid substitution, but not more than 50 conservative amino acid substitutions, not more than 40 conservative amino acid substitutions, not more than 30 conservative amino acid substitutions, and not more than 20 conservative amino acid substitutions. Also provided are polypeptides which comprise the amino acid sequence of a OBG3 or gOBG3 fragment, having at least one, but not more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 conservative amino acid substitutions.

Another specific embodiment of a modified OBG3 or gOBG3 fragment of the invention is a polypeptide that is resistant to proteolysis, for example a OBG3 or gOBG3 fragment in which a —CONH— peptide bond is modified and replaced by one or more of the following: a (CH2NH) reduced bond; a (NHCO) retro inverso bond; a (CH2—O) methylene-oxy bond; a (CH2-S) thiomethylene bond; a (CH2CH2) carba bond; a (CO—CH2) cetomethylene bond; a (CHOH—CH2) hydroxyethylene bond); a (N—N) bound; a E-alcene bond; or a —CH═CH— bond. Thus, the invention also encompasses an OBG3 or gOBG3 fragment or a variant thereof in which at least one peptide bond has been modified as described above.

In addition, amino acids have chirality within the body of either L or D. In some embodiments it is preferable to alter the chirality of the amino acids in the OBG3 or gOBG3 polypeptide fragments of the invention in order to extend half-life within the body. Thus, in some embodiments, one or more of the amino acids are preferably in the L configuration. In other embodiments, one or more of the amino acids are preferably in the D configuration.

Percent Identity

The polypeptides of the present invention also include polypeptides having an amino acid sequence at least 50% identical, at least 60% identical, or 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identical to an OBG3 or gOBG3 fragment as described above. By a polypeptide having an amino acid sequence at least, for example, 95% “identical” to an OBG3 or gOBG3 fragment amino acid sequence is meant that the amino acid sequence is identical to the OBG3 or gOBG3 polypeptide fragment sequence except that it may include up to five amino acid alterations per each 100 amino acids of the OBG3 or gOBG3 polypeptide fragment amino acid sequence. The reference sequence is the OBG3 or gOBG3 polypeptide fragment with a sequence corresponding to the sequence of the sequence listing. Thus, to obtain a polypeptide having an amino acid sequence at least 95% identical to an OBG3 or gOBG3 fragment amino acid sequence, up to 5% (5 of 100) of the amino acid residues in the sequence may be inserted, deleted, or substituted with another amino acid compared with the OBG3 or gOBG3 polypeptide fragment sequence. These alterations may occur at the amino or carboxy termini or anywhere between those terminal positions, interspersed either individually among residues in the sequence or in one or more contiguous groups within the sequence.

As a practical matter, whether any particular polypeptide is a percentage identical to an OBG3 or gOBG3 fragment can be determined conventionally using known computer programs. Such algorithms and programs include, but are by no means limited to, TBLASTN, BLASTP, FASTA, TFASTA, and CLUSTALW (Pearson and Lipman, (1988) Proc Natl Acad Sci USA April;85(8):2444-8; Altschul et al., (1990) J. Mol. Biol. 215(3):403-410; Thompson et al., (1994) Nucleic Acids Res. 22(2):4673-4680; Higgins et al., (1996) Meth. Enzymol. 266:383-402; Altschul et al., (1997) Nuc. Acids Res. 25:3389 -3402; Altschul et al., (1993) Nature Genetics 3:266-272). In a particularly preferred embodiment, protein and nucleic acid sequence homologies are evaluated using the Basic Local Alignment Search Tool (“BLAST”), which is well known in the art (See, e.g., Karlin and Altschul (1990) Proc Natl Acad Sci USA March;87(6):2264-8; Altschul et al., 1990, 1993, 1997, all supra). In particular, five specific BLAST programs are used to perform the following tasks:

(1) BLASTP and BLAST3 compare an amino acid query sequence against a protein sequence database;

(2) BLASTN compares a nucleotide query sequence against a nucleotide sequence database;

(3) BLASTX compares the six-frame conceptual translation products of a query nucleotide sequence (both strands) against a protein sequence database;

(4) TBLASTN compares a query protein sequence against a nucleotide sequence database translated in all six reading frames (both strands); and

(5) TBLASTX compares the six-frame translations of a nucleotide query sequence against the six-frame translations of a nucleotide sequence database.

The BLAST programs identify homologous sequences by identifying similar segments, which are referred to herein as “high-scoring segment pairs,” between a query amino or nucleic acid sequence and a test sequence which is preferably obtained from a protein or nucleic acid sequence database. High-scoring segment pairs are preferably identified (i.e., aligned) by means of a scoring matrix, many of which are known in the art. Preferably, the scoring matrix used is the BLOSUM62 matrix (see, Gonnet et al., (1992) Science June 5;256(5062):1443-5; Henikoff and Henikoff (1993) Proteins September;17(1):49 -61). Less preferably, the PAM or PAM250 matrices may also be used (See, e.g., Schwartz and Dayhoff, eds, (1978) Matrices for Detecting Distance Relationships: Atlas of Protein Sequence and Structure, Washington: National Biomedical Research Foundation). The BLAST programs evaluate the statistical significance of all high-scoring segment pairs identified, and preferably selects those segments which satisfy a user-specified threshold of significance, such as a user-specified percent homology. Preferably, the statistical significance of a high-scoring segment pair is evaluated using the statistical significance formula of Karlin (See, e.g., Karlin and Altschul, (1990) Proc Natl Acad Sci USA March;87(6):2264-8). The BLAST programs may be used with the default parameters or with modified parameters provided by the user. Preferably, the parameters are default parameters.

A preferred method for determining the best overall match between a query sequence (a sequence of the present invention) and a subject sequence, also referred to as a global sequence alignment, can be determined using the FASTDB computer program based on the algorithm of Brutlag et al. (1990) Comp. App. Biosci. 6:237-245. In a sequence alignment the query and subject sequences are both amino acid sequences. The result of said global sequence alignment is in percent identity. Preferred parameters used in a FASTDB amino acid alignment are: Matrix=PAM 0, k-tuple=2, Mismatch Penalty=1, Joining Penalty=20, Randomization Group=25 Length=0, Cutoff Score=1, Window Size=sequence length, Gap Penalty=5, Gap Size Penalty=0.05, Window Size=247 or the length of the subject amino acid sequence, whichever is shorter.

If the subject sequence is shorter than the query sequence due to N- or C-terminal deletions, not because of internal deletions, the results, in percent identity, must be manually corrected because the FASTDB program does not account for N- and C-terminal truncations of the subject sequence when calculating global percent identity. For subject sequences truncated at the N- and C-termini, relative to the query sequence, the percent identity is corrected by calculating the number of residues of the query sequence that are N- and C-terminal of the subject sequence, that are not matched/aligned with a corresponding subject residue, as a percent of the total bases of the query sequence. Whether a residue is matched/aligned is determined by results of the FASTDB sequence alignment. This percentage is then subtracted from the percent identity, calculated by the above FASTDB program using the specified parameters, to arrive at a final percent identity score. This final percent identity score is what is used for the purposes of the present invention. Only residues to the N- and C-termini of the subject sequence, which are not matched/aligned with the query sequence, are considered for the purposes of manually adjusting the percent identity score. That is, only query amino acid residues outside the farthest N- and C-terminal residues of the subject sequence.

For example, a 90 amino acid residue subject sequence is aligned with a 100-residue query sequence to determine percent identity. The deletion occurs at the N-terminus of the subject sequence and therefore, the FASTDB alignment does not match/align with the first residues at the N-terminus. The 10 unpaired residues represent 10% of the sequence (number of residues at the N- and C-termini not matched/total number of residues in the query sequence) so 10% is subtracted from the percent identity score calculated by the FASTDB program. If the remaining 90 residues were perfectly matched the final percent identity would be 90%.

In another example, a 90-residue subject sequence is compared with a 100-residue query sequence. This time the deletions are internal so there are no residues at the N- or C-termini of the subject sequence, which are not matched/aligned with the query. In this case, the percent identity calculated by FASTDB is not manually corrected. Once again, only residue positions outside the N- and C-terminal ends of the subject sequence, as displayed in the FASTDB alignment, which are not matched/aligned with the query sequence are manually corrected. No other manual corrections are made for the purposes of the present invention.

Production

Note, throughout the disclosure, wherever OBG3 polypeptide fragments are discussed, gOBG3fragments are specifically intended to be included as a preferred subset of OBG3 polypeptide fragments.

OBG3 polypeptide fragments are preferably isolated from human or mammalian tissue samples or expressed from human or mammalian genes in human or mammalian cells. The OBG3 polypeptide fragments of the invention can be made using routine expression methods known in the art. The polynucleotide encoding the desired polypeptide fragments is ligated into an expression vector suitable for any convenient host. Both eukaryotic and prokaryotic host systems are used in forming recombinant polypeptide fragments. The polypeptide fragment is then isolated from lysed cells or from the culture medium and purified to the extent needed for its intended use. Purification is by any technique known in the art, for example, differential extraction, salt fractionation, chromatography, centrifugation, and the like. See, for example, Methods in Enzymology for a variety of methods for purifying proteins. Also, see Examples 1-3 for methods previously used for OBG3 polypeptide fragments.

In a alternative embodiment, the polypeptides of the invention are isolated from milk. The polypeptides can be purified as full length OBG3 polypeptides, which can then be cleaved, if appropriate, in vitro to generate an OBG3 fragment, or, alternatively, OBG3 fragments themselves can be purified from the milk. Any of a large number of methods can be used to purify the present polypeptides from milk, including those taught in Protein Purification Applications, A Practical Approach (New Edition), Edited by Simon Roe, AEA Technology Products and Systems, Biosciences, Harwell; Clark (1998) J Mammary Gland Biol Neoplasia 3:337-50; Wilkins and Velander (1992) 49:333-8; U.S. Pat. Nos. 6,140,552; 6,025,540; Hennighausen, Protein Expression and Purification, vol. 1, pp. 3-8 (1990); Harris et al. (1997) Bioseparation 7:31-7; Degener et al. (1998) J. Chromatog. 799:125-37; Wilkins (1993) J. Cell. Biochem. Suppl. 0 (17 part A):39; the entire disclosures of each of which are herein incorporated by reference. In a typical embodiment, milk is centrifuged, e.g. at a relatively low speed, to separate the lipid fraction, and the aqueous supernatant is then centrifuged at a higher speed to separate the casein in the milk from the remaining, “whey” fraction. Often, biomedical proteins are found in this whey fraction, and can be isolated from this fraction using standard chromatographic or other procedures commonly used for protein purification, e.g. as described elsewhere in the present application. In one preferred embodiment, OBG3 polypeptides are purified using antibodies specific to OBG3 polypeptides, e.g. using affinity chromatography. In addition, methods can be used to isolate particular OBG3 fragments, e.g. electrophoretic or other methods for isolating proteins of a particular size. The OBG3 polypeptides isolating using these methods can be naturally occurring, as OBG3 polypeptides have been discovered to be naturally present in the milk of mammals (see, e.g. Example 17), or can be the result of the recombinant production of the protein in the mammary glands of a non-human mammal, as described infra. In one such embodiment, the OBG3 fragment is produced as a fusion protein with a heterologous, antigenic polypeptide sequence, which antigenic sequence can be used to purify the protein, e.g., using standard immuno-affinity methodology.

In addition, shorter protein fragments may be produced by chemical synthesis. Alternatively, the proteins of the invention are extracted from cells or tissues of humans or non-human animals. Methods for purifying proteins are known in the art, and include the use of detergents or chaotropic agents to disrupt particles followed by differential extraction and separation of the polypeptides by ion exchange chromatography, affinity chromatography, sedimentation according to density, and gel electrophoresis.

Any OBG3 fragment cDNA, including that in FIG. 4, can be used to express OBG3 polypeptide fragments. The nucleic acid encoding the OBG3 fragment to be expressed is operably linked to a promoter in an expression vector using conventional cloning technology. The OBG3 fragment cDNA insert in the expression vector may comprise the coding sequence for: the full length OBG3 polypeptide (to be later modified); from 6 amino acids to 6 amino acids less than the full-length OBG3 polypeptide; a gOBG3 fragment; or variants and % similar polypeptides.

The expression vector is any of the mammalian, yeast, insect or bacterial expression systems known in the art, some of which are described herein, and examples of which are given in the Examples (Examples 1-3). Commercially available vectors and expression systems are available from a variety of suppliers including Genetics Institute (Cambridge, Mass.), Stratagene (La Jolla, Calif.), Promega (Madison, Wis.), and Invitrogen (San Diego, Calif.). If desired, to enhance expression and facilitate proper protein folding, the codon context and codon pairing of the sequence can be optimized for the particular expression organism into which the expression vector is introduced, as explained by Hatfield, et al., U.S. Pat. No. 5,082,767, the disclosures of which are incorporated by reference herein in their entirety.

If the nucleic acid encoding OBG3 polypeptide fragments lacks a methionine to serve as the initiation site, an initiating methionine can be introduced next to the first codon of the nucleic acid using conventional techniques. Similarly, if the insert from the OBG3 polypeptide fragment cDNA lacks a poly A signal, this sequence can be added to the construct by, for example, splicing out the Poly A signal from pSG5 (Stratagene) using BglI and SalI restriction endonuclease enzymes and incorporating it into the mammalian expression vector pXT1 (Stratagene). pXT1 contains the LTRs and a portion of the gag gene from Moloney Murine Leukemia Virus. The position of the LTRs in the construct allow efficient stable transfection. The vector includes the Herpes Simplex Thymidine Kinase promoter and the selectable neomycin gene.

The nucleic acid encoding an OBG3 fragment can be obtained by PCR from a vector containing the OBG3 nucleotide sequence using oligonucleotide primers complementary to the desired OBG3 cDNA and containing restriction endonuclease sequences for Pst I incorporated into the 5′ primer and BglII at the 5′ end of the corresponding cDNA 3′ primer, taking care to ensure that the sequence encoding the OBG3 fragment is positioned properly with respect to the poly A signal. The purified fragment obtained from the resulting PCR reaction is digested with PstI, blunt ended with an exonuclease, digested with Bgl II, purified and ligated to pXT1, now containing a poly A signal and digested with BglII. Alternative methods are presented in Examples 1-3.

Transfection of an OBG3 fragment-expressing vector into mouse NIH 3T3 cells is one embodiment of introducing polynucleotides into host cells. Introduction of a polynucleotide encoding a polypeptide into a host cell can be effected by calcium phosphate transfection, DEAE-dextran mediated transfection, cationic lipid-mediated transfection, electroporation, transduction, infection, or other methods. Such methods are described in many standard laboratory manuals, such as Davis et al. ((1986) Methods in Molecular Biology, Elsevier Science Publishing Co., Inc., Amsterdam). It is specifically contemplated that the polypeptides of the present invention may in fact be expressed by a host cell lacking a recombinant vector. Methods of expressing OBG3 fragment of the invention in cells are described in Examples 1-3.

A polypeptide of this invention (i.e. an OBG3 or gOBG3 fragment) can be recovered and purified from recombinant cell cultures by well-known methods including ammonium sulfate or ethanol precipitation, acid extraction, anion or cation exchange chromatography, phosphocellulose chromatography, hydrophobic interaction chromatography, affinity chromatography, hydroxylapatite chromatography and lectin chromatography. Most preferably, high performance liquid chromatography (“HPLC”) is employed for purification. Polypeptides of the present invention, and preferably the secreted form, can also be recovered from: products purified from natural sources, including bodily fluids, tissues and cells, whether directly isolated or cultured; products of chemical synthetic procedures; and products produced by recombinant techniques from a prokaryotic or eukaryotic host, including, for example, bacterial, yeast, higher plant, insect, and mammalian cells.

Depending upon the host employed in a recombinant production procedure, the polypeptides of the present invention may be glycosylated or may be non-glycosylated. Preferably the polypeptides of the invention are non-glycosylated. In addition, polypeptides of the invention may also include an initial modified methionine residue, in some cases as a result of host-mediated processes. Thus, it is well known in the art that the N-terminal methionine encoded by the translation initiation codon generally is removed with high efficiency from any protein after translation in all eukaryotic cells. While the N-terminal methionine on most proteins also is efficiently removed in most prokaryotes, for some proteins, this prokaryotic removal process is inefficient, depending on the nature of the amino acid to which the N-terminal methionine is covalently linked.

In addition to encompassing host cells containing the vector constructs discussed herein, the invention also encompasses primary, secondary, and immortalized host cells of vertebrate origin, particularly mammalian origin, that have been engineered to delete or replace endogenous genetic material (e.g., coding sequence), and/or to include genetic material (e.g., heterologous polynucleotide sequences) that is operably associated with the polynucleotides of the invention, and which activates, alters, and/or amplifies endogenous polynucleotides. For example, techniques known in the art may be used to operably associate heterologous control regions (e.g., promoter and/or enhancer) and endogenous polynucleotide sequences via homologous recombination, see, e.g., U.S. Pat. No. 5,641,670, issued Jun. 24, 1997; International Publication No. WO 96/29411, published Sep. 26, 1996; International Publication No. WO 94/12650, published Aug. 4, 1994; Koller et al., (1989) Proc Natl Acad Sci USA November;86(22):8932-5; Koller et al., (1989) Proc Natl Acad Sci USA November;86(22):8927-31; and Zijlstra et al. (1989) Nature November 23;342(6248):435-8; the disclosures of each of which are incorporated by reference in their entireties).

Modifications

In addition, polypeptides of the invention can be chemically synthesized using techniques known in the art (See, e.g., Creighton, 1983 Proteins. New York, N.Y.: W. H. Freeman and Company; and Hunkapiller et al., (1984) Nature July 12 -18;310(5973):105-11). For example, a relative short fragment of the invention can be synthesized by use of a peptide synthesizer. Furthermore, if desired, nonclassical amino acids or chemical amino acid analogs can be introduced as a substitution or addition into the fragment sequence. Non-classical amino acids include, but are not limited to, to the D-isomers of the common amino acids, 2,4-diaminobutyric acid, a-amino isobutyric acid, 4-aminobutyric acid, Abu, 2-amino butyric acid, g-Abu, e-Ahx, 6-amino hexanoic acid, Aib, 2-amino isobutyric acid, 3-amino propionic acid, ornithine, norleucine, norvaline, hydroxyproline, sarcosine, citrulline, homocitrulline, cysteic acid, t-butylglycine, t-butylalanine, phenylglycine, cyclohexylalanine, b-alanine, fluoroamino acids, designer amino acids such as b-methyl amino acids, Ca-methyl amino acids, Na-methyl amino acids, and amino acid analogs in general. Furthermore, the amino acid can be D (dextrorotary) or L (levorotary).

The invention encompasses polypeptide fragments which are differentially modified during or after translation, e.g., by glycosylation, acetylation, phosphorylation, amidation, derivatization by known protecting/blocking groups, proteolytic cleavage, linkage to an antibody molecule or other cellular ligand, etc. Any of numerous chemical modifications may be carried out by known techniques, including but not limited, to specific chemical cleavage by cyanogen bromide, trypsin, chymotrypsin, papain, V8 protease, NaBH4; acetylation, formylation, oxidation, reduction; metabolic synthesis in the presence of tunicamycin; etc.

Additional post-translational modifications encompassed by the invention include, for example, N-linked or O-linked carbohydrate chains, processing of N-terminal or C-terminal ends), attachment of chemical moieties to the amino acid backbone, chemical modifications of N-linked or O-linked carbohydrate chains, and addition or deletion of an N-terminal methionine residue as a result of procaryotic host cell expression. The polypeptide fragments may also be modified with a detectable label, such as an enzymatic, fluorescent, isotopic or affinity label to allow for detection and isolation of the polypeptide.

Also provided by the invention are chemically modified derivatives of the polypeptides of the invention that may provide additional advantages such as increased solubility, stability and circulating time of the polypeptide, or decreased immunogenicity. See U.S. Pat. No: 4,179,337. The chemical moieties for derivitization may be selected from water soluble polymers such as polyethylene glycol, ethylene glycol/propylene glycol copolymers, carboxymethylcellulose, dextran, polyvinyl alcohol and the like. The polypeptides may be modified at random positions within the molecule, or at predetermined positions within the molecule and may include one, two, three or more attached chemical moieties.

The polymer may be of any molecular weight, and may be branched or unbranched. For polyethylene glycol, the preferred molecular weight is between about 1 kDa and about 100 kDa (the term “about” indicating that in preparations of polyethylene glycol, some molecules will weigh more, some less, than the stated molecular weight) for ease in handling and manufacturing. Other sizes may be used, depending on the desired therapeutic profile (e.g., the duration of sustained release desired, the effects, if any on biological activity, the ease in handling, the degree or lack of antigenicity and other known effects of the polyethylene glycol to a therapeutic protein or analog).

The polyethylene glycol molecules (or other chemical moieties) should be attached to the polypeptide with consideration of effects on functional or antigenic domains of the polypeptide. There are a number of attachment methods available to those skilled in the art, e.g., EP 0 401 384, herein incorporated by reference (coupling PEG to G-CSF), see also Malik et al. (1992) Exp Hematol. September;20(8):1028-35, reporting pegylation of GM-CSF using tresyl chloride). For example, polyethylene glycol may be covalently bound through amino acid residues via a reactive group, such as, a free amino or carboxyl group. Reactive groups are those to which an activated polyethylene glycol molecule may be bound. The amino acid residues having a free amino group may include lysine residues and the N-terminal amino acid residues; those having a free carboxyl group may include aspartic acid residues, glutamic acid residues and the C-terminal amino acid residue. Sulfhydryl groups may also be used as a reactive group for attaching the polyethylene glycol molecules. Preferred for therapeutic purposes is attachment at an amino group, such as attachment at the N-terminus or lysine group.

One may specifically desire proteins chemically modified at the N-terminus. Using polyethylene glycol as an illustration of the present composition, one may select from a variety of polyethylene glycol molecules (by molecular weight, branching, etc.), the proportion of polyethylene glycol molecules to protein (polypeptide) molecules in the reaction mix, the type of pegylation reaction to be performed, and the method of obtaining the selected N-terminally pegylated protein. The method of obtaining the N-terminally pegylated preparation (i.e., separating this moiety from other monopegylated moieties if necessary) may be by purification of the N-terminally pegylated material from a population of pegylated protein molecules. Selective proteins chemically modified at the N-terminus may be accomplished by reductive alkylation, which exploits differential reactivity of different types of primary amino groups (lysine versus the N-terminal) available for derivatization in a particular protein. Under the appropriate reaction conditions, substantially selective derivatization of the protein at the N-terminus with a carbonyl group containing polymer is achieved.

Multimers

The polypeptide fragments of the invention may be in monomers or multimers (i.e., dimers, trimers, tetramers and higher multimers). Accordingly, the present invention relates to monomers and multimers of the polypeptide fragments of the invention, their preparation, and compositions (preferably, pharmaceutical or physiologically acceptable compositions) containing them. In specific embodiments, the polypeptides of the invention are monomers, dimers, trimers or tetramers. In additional embodiments, the multimers of the invention are at least dimers, at least trimers, or at least tetramers.

Multimers encompassed by the invention may be homomers or heteromers. As used herein, the term homomer, refers to a multimer containing only polypeptides corresponding to the OBG3 polypeptide fragments of the invention (including polypeptide fragments, variants, splice variants, and fusion proteins corresponding to these polypeptide fragments as described herein). These homomers may contain polypeptide fragments having identical or different amino acid sequences. In a specific embodiment, a homomer of the invention is a multimer containing only polypeptide fragments having an identical amino acid sequence. In another specific embodiment, a homomer of the invention is a multimer containing polypeptide fragments having different amino acid sequences. In specific embodiments, the multimer of the invention is a homodimer (e.g. containing polypeptide fragments having identical or different amino acid sequences) or a homotrimer (e.g., containing polypeptide fragments having identical and/or different amino acid sequences). In additional embodiments, the homomeric multimer of the invention is at least a homodimer, at least a homotrimer, or at least a homotetramer.

As used herein, the term heteromer refers to a multimer containing one or more heterologous polypeptides (i.e., corresponding to different proteins or polypeptide fragments thereof) in addition to the polypeptides of the invention. In a specific embodiment, the multimer of the invention is a heterodimer, a heterotrimer, or a heterotetramer. In additional embodiments, the heteromeric multimer of the invention is at least a heterodimer, at least a heterotrimer, or at least a heterotetramer.

Multimers of the invention may be the result of hydrophobic, hydrophilic, ionic and/or covalent associations and/or may be indirectly linked, by for example, liposome formation. Thus, in one embodiment, multimers of the invention, such as, for example, homodimers or homotrimers, are formed when polypeptides of the invention contact one another in solution. In another embodiment, heteromultimers of the invention, such as, for example, heterotrimers or heterotetramers, are formed when polypeptides of the invention contact antibodies to the polypeptides of the invention (including antibodies to the heterologous polypeptide sequence in a fusion protein of the invention) in solution. In other embodiments, multimers of the invention are formed by covalent associations with and/or between the polypeptides of the invention. Such covalent associations may involve one or more amino acid residues contained in the polypeptide sequence (e.g., that recited in the sequence listing, or contained in the polypeptide encoded by a deposited clone). In one instance, the covalent associations are cross-linking between cysteine residues located within the polypeptide sequences, which interact in the native (i.e., naturally occurring) polypeptide. In another instance, the covalent associations are the consequence of chemical or recombinant manipulation. Alternatively, such covalent associations may involve one or more amino acid residues contained in the heterologous polypeptide sequence in a fusion protein of the invention.

In one example, covalent associations are between the heterologous sequence contained in a fusion protein of the invention (see, e.g., U.S. Pat. No. 5,478,925). In a specific example, the covalent associations are between the heterologous sequence contained in an Fc fusion protein of the invention (as described herein). In another specific example, covalent associations of fusion proteins of the invention are between heterologous polypeptide sequence from another protein that is capable of forming covalently associated multimers, such as for example, oseteoprotegerin (see, e.g., International Publication NO: WO 98/49305, the contents of which are herein incorporated by reference in its entirety). In another embodiment, two or more polypeptides of the invention are joined through peptide linkers. Examples include those peptide linkers described in U.S. Pat. No. 5,073,627 (hereby incorporated by reference). Proteins comprising multiple polypeptides of the invention separated by peptide linkers may be produced using conventional recombinant DNA technology.

Another method for preparing multimer polypeptides of the invention involves use of polypeptides of the invention fused to a leucine zipper or isoleucine zipper polypeptide sequence. Leucine zipper and isoleucine zipper domains are polypeptides that promote multimerization of the proteins in which they are found. Leucine zippers were originally identified in several DNA-binding proteins, and have since been found in a variety of different proteins (Landschulz et al., (1988) Genes Dev. July;2(7):786-800). Among the known leucine zippers are naturally occurring peptides and derivatives thereof that dimerize or trimerize. Examples of leucine zipper domains suitable for producing soluble multimeric proteins of the invention are those described in PCT application WO 94/10308, hereby incorporated by reference. Recombinant fusion proteins comprising a polypeptide of the invention fused to a polypeptide sequence that dimerizes or trimerizes in solution are expressed in suitable host cells, and the resulting soluble multimeric fusion protein is recovered from the culture supernatant using techniques known in the art.

Trimeric polypeptides of the invention may offer the advantage of enhanced biological activity. Preferred leucine zipper moieties and isoleucine moieties are those that preferentially form trimers. One example is a leucine zipper derived from lung surfactant protein D (SPD), as described in Hoppe et al. FEBS Letters (1994) May 16;344(2-3):191 -5. and in U.S. patent application Ser. No. 08/446,922, hereby incorporated by reference. Other peptides derived from naturally occurring trimeric proteins may be employed in preparing trimeric polypeptides of the invention. In another example, proteins of the invention are associated by interactions between Flag® & polypeptide sequence contained in fusion proteins of the invention containing Flag® polypeptide sequence. In a further embodiment, proteins of the invention are associated by interactions between heterologous polypeptide sequence contained in Flag® fusion proteins of the invention and anti Flag® antibody.

The multimers of the invention may be generated using chemical techniques known in the art. For example, polypeptides desired to be contained in the multimers of the invention may be chemically cross-linked using linker molecules and linker molecule length optimization techniques known in the art (see, e.g., U.S. Pat. No. 5,478,925, which is herein incorporated by reference in its entirety). Additionally, multimers of the invention may be generated using techniques known in the art to form one or more inter-molecule cross-links between the cysteine residues located within the sequence of the polypeptides desired to be contained in the multimer (see, e.g., U.S. Pat. No. 5,478,925, which is herein incorporated by reference in its entirety). Further, polypeptides of the invention may be routinely modified by the addition of cysteine or biotin to the C-terminus or N-terminus of the polypeptide and techniques known in the art may be applied to generate multimers containing one or more of these modified polypeptides (see, e.g., U.S. Pat. No. 5,478,925, which is herein incorporated by reference in its entirety). Additionally, at least 30 techniques known in the art may be applied to generate liposomes containing the polypeptide components desired to be contained in the multimer of the invention (see, e.g., U.S. Pat. No. 5,478,925, which is herein incorporated by reference in its entirety).

Alternatively, multimers of the invention may be generated using genetic engineering techniques known in the art. In one embodiment, polypeptides contained in multimers of the invention are produced recombinantly using fusion protein technology described herein or otherwise known in the art (see, e.g., U.S. Pat. No. 5,478,925, which is herein incorporated by reference in its entirety). In a specific embodiment, polynucleotides coding for a homodimer of the invention are generated by ligating a polynucleotide sequence encoding a polypeptide of the invention to a sequence encoding a linker polypeptide and then further to a synthetic polynucleotide encoding the translated product of the polypeptide in the reverse orientation from the original C-terminus to the N-terminus (lacking the leader sequence) (see, e.g., U.S. Pat. No. 5,478,925, which is herein incorporated by reference in its entirety). In another embodiment, recombinant techniques described herein or otherwise known in the art are applied to generate recombinant polypeptides of the invention which contain a transmembrane domain (or hyrophobic or signal peptide) and which can be incorporated by membrane reconstitution techniques into liposomes (See, e.g., U.S. Pat. No. 5,478,925, which is herein incorporated by reference in its entirety).

II. OBG3 Polynucleotides of the Invention

Preferred polynucleotides are those that encode OBG3 and gOBG3 polypeptide fragments of the invention. The recombinant polynucleotides encoding OBG3 and gOBG3 polypeptide fragments can be used in a variety of ways, including, but not limited to, expressing the polypeptide in recombinant cells for use in screening assays for antagonists and agonists of its activity as well as to facilitate its purification for use in a variety of ways including, but not limited to screening assays for agonists and antagonists of its activity, diagnostic screens, and raising antibodies, as well as treatment and/or prevention of obesity-related diseases and disorders and/or to reduce body mass.

The invention relates to the polynucleotides encoding OBG3 and gOBG3 polypeptide fragments and variant polypeptide fragments thereof as described herein. These polynucleotides may be purified, isolated, and/or recombinant. In all cases, the desired OBG3 and gOBG3 polynucleotides of the invention are those that encode OBG3 and gOBG3 polypeptide fragments of the invention have obesity-related activity as described and discussed herein.

Fragments

A polynucleotide fragment is a portion of a polynucleotide of the sequence listing or of a sequence that encodes a specified OBG3 polypeptide of the invention. Further, a polynucleotide fragment is a polynucleotide having a sequence that is the same as a portion, but not all, of a full length or specified OBG3 or gOBG3 polynucleotide sequence. Such fragments may be “free-standing”, i.e. not part of or fused to other polynucleotides, or they may be comprised within another non-OBG3 or non-gOBG3 (heterologous) polynucleotide of which they form a part or region. However, several OBG3 or gOBG3 polynucleotide fragments may be comprised within a single polynucleotide.

The OBG3 polynucleotides of the invention comprise from 18 consecutive bases to 18 consecutive bases less than the full length polynucleotide sequence encoding the full length OBG3 polypeptide, for example the full length OBG3 polynucleotide sequences in SEQ ID NO: 1, SEQ ID NO:3, or SEQ ID NO:5. In one aspect of this embodiment, the polynucleotide comprises at least 18, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 105, 110, 115, 120, 125, 130, 135, 140, 145, 150, 155, 160, 165, 170, 175, 180, 185, 190, 195, 200, 205, 210, 215, 220, 225, 230, 235, 240, 245, 250, 255, 260, 265, 270, 275, 280, 285, 290, 295, 300, 305, 310, 315, 320, 325, 330, 335, 340, 345, 350, 355, 360, 365, 370, 375, 380, 385, 390, 395, 400, 405, 410, 415, 420, 425, 430, 435, 440, 445, 450, 455, 460, 465, 470, 475, 480, 485, 490, 495, 500, 505, 510, 515, 520, 525, 530, 535, 540, 545, 550, 555, 560, 565, 570, 575, 580, 585, 590, 595, 600, 605, 610, 615, 620, 625, 630, 635, 640, 645, 650, 655, 660, 665, 670, 675, 680, 685, 690, 695, 700, 705, 710, 715, 720, 725, or 740 consecutive nucleotides of a polynucleotide of the present invention.

In addition to the above preferred nucleic acid sizes, further preferred nucleic acids comprise at least 18 nucleotides, wherein “at least 18” is defined as any integer between 18 and the integer representing 18 nucleotides less than the 3′ most nucleotide position of the intact OBG3 polynucleotide as set forth in the sequence listing (SEQ ID NO:1, SEQ ID NO:3, or SEQ ID NO:5) or elsewhere herein.

Further included as preferred polynucleotides of the present invention are nucleic acid fragments at least 18 nucleotides in length, as described above, that are further specified in terms of their 5′ and 3′ position. The 5′ and 3′ positions are represented by the position numbers set forth in the sequence listing. For allelic and degenerate and other variants, position 1 is defined as the 5′ most nucleotide of the ORF, i.e., the nucleotide “A” of the start codon (ATG) with the remaining nucleotides numbered consecutively. Therefore, every combination of a 5′ and 3′ nucleotide position that a polynucleotide fragment of the invention, at least 18 contiguous nucleotides in length, could occupy on an intact OBG3 polynucleotide of the invention is included in the invention as an individual species. The polynucleotide fragments specified by 5′ and 3′ positions can be immediately envisaged and are therefore not individually listed solely for the purpose of not unnecessarily lengthening the specification.

It is noted that the above species of polynucleotide fragments of the present invention may alternatively be described by the formula “x to y”; where “x” equals the 5′ most nucleotide position and “y” equals the 3′ most nucleotide position of the polynucleotide; and further where “x” equals an integer between 1 and the number of nucleotides of the polynucleotide sequence of the present invention minus 18, and where “y” equals an integer between 19 and the number of nucleotides of the polynucleotide sequence of the present invention minus 18 nucleotides; and where “x” is an integer smaller then “y” by at least 18.

The present invention also provides for the exclusion of any species of polynucleotide fragments of the present invention specified by 5′ and 3′ positions or polynucleotides specified by size in nucleotides as described above. Any number of fragments specified by 5′ and 3′ positions or by size in nucleotides, as described above, may be excluded.

The gOBG3 polynucleotide fragments of the invention comprise from 18 consecutive bases to the full length polynucleotide sequence encoding the gOBG3 fragments described in Section II of the Preferred Embodiments of the Invention. In one aspect of this embodiment, the polynucleotide comprises at least 18, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 105, 110, 115, 120, 125, 130, 135, 140, 145, 150, 155, 160, 165, 170, 175, 180, 185, 190, 195, 200, 205, 210, 215, 220, 225, 230, 235, 240, 245, 250, 255, 260, 265, 270, 275, 280, 285, 290, 295, 300, 305, 310, 315, 320, 325, 330, 335, 340, 345, 350, 355, 360, 365, 370, 375, 380, 385, 390, 395, 400, 405, 410, 415, 420, 425, 430, 435, 440, 445, 450, 455, 460, or 465 consecutive nucleotides of a polynucleotide of the present invention.

In addition to the above preferred nucleic acid sizes, further preferred nucleic acids comprise at least 18 nucleotides, wherein “at least 18” is defined as any integer between 18 and the integer corresponding to the 3′ most nucleotide position of a gOBG3 fragment cDNA herein.

Further included as preferred polynucleotides of the present invention are nucleic acid fragments at least 18 nucleotides in length, as described above, that are further specified in terms of their 5′ and 3′ position. The 5′ and 3′ positions are represented by the position numbers set forth in the sequence listing below. For allelic and degenerate and other variants, position 1 is defined as the 5′ most nucleotide of the open reading frame (ORF), i.e., the nucleotide “A” of the start codon (ATG) with the remaining nucleotides numbered consecutively. Therefore, every combination of a 5′ and 3′ nucleotide position that a polynucleotide fragment invention, at least 18 contiguous nucleotides in length, could occupy on a gOBG3 fragment polynucleotide of the present invention is included in the invention as an individual species. The polynucleotide fragments specified by 5′ and 3′ positions can be immediately envisaged and are therefore not individually listed solely for the purpose of not unnecessarily lengthening the specification.

It is noted that the above species of polynucleotide fragments of the present invention may alternatively be described by the formula “x to y”,; where “x” equals the 5′ most nucleotide position and “y” equals the 3′ most nucleotide position of the polynucleotide; and further where “x” equals an integer between 1 and the number of nucleotides of the gOBG3 polynucleotide sequence of the present invention minus 18, and where “y” equals an integer between 9 and the number of nucleotides of the gOBG3 polynucleotide sequence of the present invention; and where “x” is an integer smaller than “y” by at least 18. Every combination of “x” and “y” positions are included as specific embodiments of the invention. Moreover, the formula “x” to “y” may be modified as ‘“x1-x2” to “y1-y2”’, wherein “x1-x2” and “y1-y2” represent positional ranges selected from any two nucleotide positions of the sequence listing. Alternative formulas include ‘“x1-x2” to “y”’ and ‘“x” to “y1-y2”’.

These specific embodiments, and other polynucleotide fragment embodiments described herein may be modified as being “at least”, “equal to”, “equal to or less than”, “less than”, “at least ______ but not greater than ______” or “from ______ to ______”. a specified size or specified 5′ and/or 3′ positions.

The present invention also provides for the exclusion of any species of polynucleotide fragments of the present invention specified by 5′ and 3′ positions or polynucleotides specified by size in nucleotides as described above. Any number of fragments specified by 5′ and 3′ positions or by size in nucleotides, as described above, may be excluded.

Variants

In other preferred embodiments, variants of OBG3 and gOBG3 polynucleotides encoding OBG3 and gOBG3 fragments are envisioned. Variants of polynucleotides, as the term is used herein, are polynucleotides whose sequence differs from a reference polynucleotide. A variant of a polynucleotide may be a naturally occurring variant such as a naturally occurring allelic variant, or it may be a variant that is not known to occur naturally. Such non-naturally occurring variants of the polynucleotide may be made by mutagenesis techniques, including those applied to polynucleotides, cells or organisms. Generally, differences are limited so that the nucleotide sequences of the reference and the variant are closely similar overall and, in many regions, identical.

Polynucleotide variants that comprise a sequence substantially different from those described above but that, due to the degeneracy of the genetic code, still encode OBG3 and gOBG3 polypeptide fragments of the present invention are also specifically envisioned. It would also be routine for one skilled in the art to generate the degenerate variants described above, for instance, to optimize codon expression for a particular host (e.g., change codons in the human mRNA to those preferred by other mammalian or bacterial host cells).

As stated above, variant polynucleotides may occur naturally, such as a natural allelic variant, or by recombinant methods. By an “allelic variant” is intended one of several alternate forms of a gene occupying a given locus on a chromosome of an organism (See, e.g., B. Lewin, (1990) Genes IV, Oxford University Press, New York). Non-naturally occurring variants may be produced using art-known mutagenesis techniques. Such nucleic acid variants include those produced by nucleotide substitutions, deletions, or additions. The substitutions, deletions, or additions may involve one or more nucleotides. Alterations in the coding regions may produce conservative or non-conservative amino acid substitutions, deletions or additions. Especially preferred among these are silent substitutions, additions and deletions, which do not alter the properties and activities of an OBG3 or gOBG3 polypeptide fragment of the invention. Also preferred in this regard are conservative substitutions.

Nucleotide changes present in a variant polynucleotide are preferably silent, which means that they do not alter the amino acids encoded by the polynucleotide. However, nucleotide changes may also result in amino acid substitutions, additions, deletions, fusions and truncations in the polypeptide encoded by the reference sequence.

In cases where the nucleotide substitutions result in one or more amino acid changes, preferred OBG3 and gOBG3 polypeptide fragments include those that retain one or more obesity-related activity as described in Section I of the Preferred Embodiments of the Invention.

By “retain the same activities” is meant that the activity measured using the polypeptide encoded by the variant OBG3 or gOBG3 polynucleotide in assays is at least 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100%, and not more than 101%, 102%, 103%, 104%, 105%, 110%, 115%, 120% or 125% of the activity measured using a gOBG3 fragment described in the Examples Section herein.

By the activity being “increased” is meant that the activity measured using the polypeptide encoded by the variant OBG3 or gOBG3 polynucleotide in assays is at least 125%, 130%, 135%, 140%, 145%, 150%, 155%, 160%, 170%, 180%, 190%, 200%, 225%, 250%, 275%, 300%, 325%, 350%, 375%, 400%, 450%, or 500% of the activity measured using a gOBG3 fragment described in the Examples Section herein.

By the activity being “decreased” is meant that the activity measured using the polypeptide encoded by the variant OBG3 or gOBG3 polynucleotide in assays is decreased by at least 25%, 30%, 35%, 40%, 45%, or 50% of the activity measured using a gOBG3 fragment described in the Examples Section herein

Percent Identity

The present invention is further directed to nucleic acid molecules having sequences at least 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98% or 99% identical to the polynucleotide sequences of SEQ ID NO:1, SEQ ID NO:3, or SEQ ID NO:5 or fragments thereof that encode a polypeptide having obesity-related activity as described in Section I of the Preferred Embodiments of the Invention. Of course, due to the degeneracy of the genetic code, one of ordinary skill in the art will immediately recognize that a large number of the nucleic acid molecules at least 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, or 99% identical to the nucleic acid sequences shown in SEQ ID NO:1, SEQ ID NO:3, or SEQ ID NO:5 or fragments thereof will encode a polypeptide having biological activity. In fact, since degenerate variants of these nucleotide sequences all encode the same polypeptide, this will be clear to the skilled artisan even without performing the above described comparison assay. It will be further recognized in the art that, for such nucleic acid molecules that are not degenerate variants, a reasonable number will also encode a polypeptide having biological activity. This is because the skilled artisan is fully aware of amino acid substitutions that are either less likely or not likely to significantly affect protein function (e.g., replacing one aliphatic amino acid with a second aliphatic amino acid), as further described previously in Section I of the Preferred Embodiments of the Invention.

By a polynucleotide having a nucleotide sequence at least, for example, 95% “identical” to a reference nucleotide sequence of the present invention, it is intended that the nucleotide sequence of the polynucleotide is identical to the reference sequence except that the polynucleotide sequence may include up to five point mutations per each 100 nucleotides of the reference nucleotide sequence encoding the OBG3 or gOBG3 fragment. In other words, to obtain a polynucleotide having a nucleotide sequence at least 95% identical to a reference nucleotide sequence, up to 5% of the nucleotides in the reference sequence may be deleted, inserted, or substituted with another nucleotide. The query sequence may be an entire sequence or any fragment specified as described herein.

The methods of determining and defining whether any particular nucleic acid molecule or polypeptide is at least 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98% or 99% identical to a nucleotide sequence of the present invention can be done by using known computer programs. A preferred method for determining the best overall match between a query sequence (a sequence of the present invention) and a subject sequence, also referred to as a global sequence alignment, can be determined using the FASTDB computer program based on the algorithm of Brutlag et al., ((1990) Comput Appl Biosci. July;6(3):237-45). In a sequence alignment the query and subject sequences are both DNA sequences. An RNA sequence can be compared by first converting U's to T's. The result of said global sequence alignment is in percent identity. Preferred parameters used in a FASTDB alignment of DNA sequences to calculate percent identity are: Matrix=Unitary, k-tuple=4, Mismatch Penalty−1, Joining Penalty=30, Randomization Group Length=0, Cutoff Score=1, Gap Penalty=5, Gap Size Penalty 0.05, Window Size=500 or the length of the subject nucleotide sequence, whichever is shorter.

If the subject sequence is shorter than the query sequence because of 5′ or 3′ deletions, not because of internal deletions, a manual correction must be made to the results. This is because the FASTDB program does not account for 5′ and 3′ truncations of the subject sequence when calculating percent identity. For subject sequences truncated at the 5′ or 3′ ends, relative to the query sequence, the percent identity is corrected by calculating the number of bases of the query sequence that are 5′ and 3′ of the subject sequence, which are not matched/aligned, as a percent of the total bases of the query sequence. Whether a nucleotide is matched/aligned is determined by results of the FASTDB sequence alignment. This percentage is then subtracted from the percent identity, calculated by the above FASTDB program using the specified parameters, to arrive at a final percent identity score. This corrected score is what is used for the purposes of the present invention. Only nucleotides outside the 5′ and 3′ nucleotides of the subject sequence, as displayed by the FASTDB alignment, which are not matched/aligned with the query sequence, are calculated for the purposes of manually adjusting the percent identity score.

For example, a 90-nucleotide subject sequence is aligned to a 100-nucleotide query sequence to determine percent identity. The deletions occur at the 5′ end of the subject sequence and therefore, the FASTDB alignment does not show a matched/alignment of the first 10 nucleotides at 5′ end. The 10 unpaired nucleotides represent 10% of the sequence (number of nucleotides at the 5′ and 3′ ends not matched/total number of nucleotides in the query sequence) so 10% is subtracted from the percent identity score calculated by the FASTDB program. If the remaining 90 nucleotides were perfectly matched the final percent identity would be 90%.

In another example, a 90 nucleotide subject sequence is compared with a 100 nucleotide query sequence. This time the deletions are internal deletions so that there are no nucleotides on the 5′ or 3′ of the subject sequence which are not matched/aligned with the query. In this case the percent identity calculated by FASTDB is not manually corrected. Once again, only nucleotides 5′ and 3′ of the subject sequence which are not matched/aligned with the query sequence are manually corrected for. No other manual corrections are made for the purposes of the present invention.

Fusions

Further included in the present invention are polynucleotides encoding the polypeptides of the present invention that are fused in frame to the coding sequences for additional heterologous amino acid sequences. Also included in the present invention are nucleic acids encoding polypeptides of the present invention together with additional, non-coding sequences, including for example, but not limited to non-coding 5′ and 3′ sequences, vector sequence, sequences used for purification, probing, or priming. For example, heterologous sequences include transcribed, nontranslated sequences that may play a role in transcription, and mRNA processing, for example, ribosome binding and stability of mRNA. The heterologous sequences may alternatively comprise additional coding sequences that provide additional functionalities. Thus, a nucleotide sequence encoding a polypeptide may be fused to a tag sequence, such as a sequence encoding a peptide that facilitates purification of the fused polypeptide. In certain preferred embodiments of this aspect of the invention, the tag amino acid sequence is a hexa-histidine peptide, such as the tag provided in a pQE vector (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, Calif., 91311), among others, many of which are commercially available. For instance, hexa-histidine provides for convenient purification of the fusion protein (See, Gentz et al., (1989) Proc Natl Acad Sci USA Feburary;86(3):821-4). The “HA” tag is another peptide useful for purification which corresponds to an epitope derived from the influenza hemagglutinin protein (See, Wilson et al., (1984) Cell 37(3):767-78). As discussed above, other such fusion proteins include OBG3 or gOBG3 fragment cDNA fused to Fc at the N- or C-terminus.

III. Recombinant Vectors of the Invention

The term “vector” is used herein to designate either a circular or a linear DNA or RNA molecule, that is either double-stranded or single-stranded, and that comprises at least one polynucleotide of interest that is sought to be transferred in a cell host or in a unicellular or multicellular host organism.

The present invention relates to recombinant vectors comprising any one of the polynucleotides described herein.

The present invention encompasses a family of recombinant vectors that comprise polynucleotides encoding OBG3 polypeptide fragments of the invention.

In a first preferred embodiment, a recombinant vector of the invention is used to amplify the inserted polynucleotide in a suitable cell host, this polynucleotide being amplified every time that the recombinant vector replicates. The inserted polynucleotide can be one that encodes gOBG3 polypeptide fragments of the invention.

A second preferred embodiment of the recombinant vectors according to the invention consists of expression vectors comprising polynucleotides encoding OBG3 polypeptide fragments of the invention. Within certain embodiments, expression vectors are employed to express an OBG3 fragment of the invention, preferably a modified OBG3 fragment described in the present invention, which can be then purified and, for example, be used as a treatment for obesity-related diseases, or simply to reduce body mass of individuals.

Expression requires that appropriate signals are provided in the vectors, said signals including various regulatory elements, such as enhancers/promoters from both viral and mammalian sources, that drive expression of the genes of interest in host cells. Dominant drug selection markers for establishing permanent, stable, cell clones expressing the products are generally included in the expression vectors of the invention, as they are elements that link expression of the drug selection markers to expression of the polypeptide.

More particularly, the present invention relates to expression vectors which include nucleic acids encoding an OBG3 fragment of the invention, or a modified OBG3 fragment as described herein, or variants or fragments thereof, under the control of a regulatory sequence selected among OBG3 polypeptide fragments, or alternatively under the control of an exogenous regulatory sequence.

Consequently, preferred expression vectors of the invention are selected from the group consisting of: (a) an OBG3 fragment regulatory sequence and driving the expression of a coding polynucleotide operably linked thereto; and (b) an OBG3 fragment coding sequence of the invention, operably linked to regulatory sequences allowing its expression in a suitable cell host and/or host organism.

Some of the elements which can be found in the vectors of the present invention are described in further detail in the following sections.

1) General features of the expression vectors of the invention:

A recombinant vector according to the invention comprises, but is not limited to, a YAC (Yeast Artificial Chromosome), a BAC (Bacterial Artificial Chromosome), a phage, a phagemid, a cosmid, a plasmid, or even a linear DNA molecule which may consist of a chromosomal, non-chromosomal, semi-synthetic or synthetic DNA. Such a recombinant vector can comprise a transcriptional unit comprising an assembly of:

(1) a genetic element or elements having a regulatory role in gene expression, for example promoters or enhancers. Enhancers are cis-acting elements of DNA, usually from about 10 to 300 bp in length that act on the promoter to increase the transcription;

(2) a structural or coding sequence which is transcribed into mRNA and eventually translated into a polypeptide, said structural or coding sequence being operably linked to the regulatory elements described in (1); and

(3) appropriate transcription initiation and termination sequences. Structural units intended for use in yeast or eukaryotic expression systems preferably include a leader sequence enabling extracellular secretion of translated protein by a host cell. Alternatively, when a recombinant protein is expressed without a leader or transport sequence, it may include a N-terminal residue. This residue may or may not be subsequently cleaved from the expressed recombinant protein to provide a final product.

Generally, recombinant expression vectors will include origins of replication, selectable markers permitting transformation of the host cell, and a promoter derived from a highly expressed gene to direct transcription of a downstream structural sequence. The heterologous structural sequence is assembled in appropriate phase with translation initiation and termination sequences, and preferably a leader sequence capable of directing secretion of the translated protein into the periplasmic space or the extracellular medium. In a specific embodiment wherein the vector is adapted for transfecting and expressing desired sequences in mammalian host cells, preferred vectors will comprise an origin of replication in the desired host, a suitable promoter and enhancer, and also any necessary ribosome binding sites, polyadenylation sites, splice donor and acceptor sites, transcriptional termination sequences, and 5′-flanking non-transcribed sequences. DNA sequences derived from the SV40 viral genome, for example SV40 origin, early promoter, enhancer, splice and polyadenylation sites may be used to provide the required non-transcribed genetic elements.

2) Regulatory elements

Promoters

The suitable promoter regions used in the expression vectors of the present invention are chosen taking into account the cell host in which the heterologous gene is expressed. The particular promoter employed to control the expression of a nucleic acid sequence of interest is not believed to be important, so long as it is capable of directing the expression of the nucleic acid in the targeted cell. Thus, where a human cell is targeted, it is preferable to position the nucleic acid coding region adjacent to and under the control of a promoter that is capable of being expressed in a human cell, such as, for example, a human or a viral promoter. The promoter used may be constitutive or inducible.

A suitable promoter may be heterologous with respect to the nucleic acid for which it controls the expression or alternatively can be endogenous to the native polynucleotide containing the coding sequence to be expressed. Additionally, the promoter is generally heterologous with respect to the recombinant vector sequences within which the construct promoter/coding sequence has been inserted.

Promoter regions can be selected from any desired gene using, for example, CAT (chloramphenicol transferase) vectors and more preferably pKK232-8 and pCM7 vectors. Preferred bacterial promoters are the LacI, LacZ, the T3 or T7 bacteriophage RNA polymerase promoters, the gpt, lambda PR, PL and trp promoters (EP 0036776), the polyhedrin promoter, or the p10 protein promoter from baculovirus (Kit Novagen) (Smith et al., (1983) Mol Cell Biol December;3(12):2156-65; O'Reilly et al., 1992), the lambda PR promoter or also the trc promoter.

Eukaryotic promoters include CMV immediate early, HSV thymidine kinase, early and late SV40, LTRs from retrovirus, and mouse metallothionein-L. In addition, promoters specific for a particular cell type may be chosen, such as those facilitating expression in adipose tissue, muscle tissue, or liver. Selection of a convenient vector and promoter is well within the level of ordinary skill in the art.

The choice of a promoter is well within the ability of a person skilled in the field of genetic engineering. For example, one may refer to Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, NY, Vol. 1, 2, 3 (1989), or also to the procedures described by Fuller et al. (1996) Immunology in Current Protocols in Molecular Biology.

Other Regulatory Elements

Where a cDNA insert is employed, one will typically desire to include a polyadenylation signal to effect proper polyadenylation of the gene transcript. The nature of the polyadenylation signal is not believed to be crucial to the successful practice of the invention, and any such sequence may be employed such as human growth hormone and SV40 polyadenylation signals. Also contemplated as an element of the expression cassette is a terminator. These elements can serve to enhance message levels and to minimize read through from the cassette into other sequences.

Vectors containing the appropriate DNA sequence as described above can be utilized to transform an appropriate host to allow the expression of the desired polypeptide or polynucleotide.

3) Selectable Markers

Such markers would confer an identifiable change to the cell permitting easy identification of cells containing the expression construct. The selectable marker genes for selection of transformed host cells are preferably dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, TRP1 for S. cerevisiae or tetracycline, rifampicin or ampicillin resistance in E. coli, or levan saccharase for mycobacteria, this latter marker being a negative selection marker.

4) Preferred vectors

Bacterial Vectors

As a representative but non-limiting example, useful expression vectors for bacterial use can comprise a selectable marker and a bacterial origin of replication derived from commercially available plasmids comprising genetic elements of pBR322 (ATCC 37017). Such commercial vectors include, for example, pKK223-3 (Pharmacia, Uppsala, Sweden), and pGEM1 (Promega Biotec, Madison, Wis., USA).

Large numbers of other suitable vectors are known to those of skill in the art, and are commercially available, such as the following bacterial vectors : pQE70, pQE60, pQE-9 (Qiagen), pbs, pD10, phagescript, psiX174, pbluescript SK, pbsks, pNH8A, pNH16A, pNH18A, pNH46A (Stratagene); ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia); pWLNEO, pSV2CAT, pOG44, pXT1, pSG (Stratagene); pSVK3, pBPV, pMSG, pSVL (Pharmacia); pQE-30 (QIAexpress).

Baculovirus Vectors

A suitable vector for the expression of polypeptides of the invention is a baculovirus vector that can be propagated in insect cells and in insect cell lines. A specific suitable host vector system is the pVL1392/1393 baculovirus transfer vector (Pharmingen) that is used to transfect the SF9 cell line (ATCC No CRL 1711) which is derived from Spodoptera frugiperda.

Other suitable vectors for the expression of an Apm1 globular head polypeptide in a baculovirus expression system include those described by Chai et al. (1993; Biotechnol Appl Biochem. December;18 (Pt 3):259-73); Vlasak et al. (1983; Eur J Biochem September 1;135(1): 123-6); and Lenhard et al. (1996; Gene March 9;169(2): 187-90).

Viral Vectors

In one specific embodiment, the vector is derived from an adenovirus. Preferred adenovirus vectors according to the invention are those described by Feldman and Steg (1996; Semin Interv Cardiol September;1(3):203-8) or Ohno et al. (1994; Science August 5;265(5173):781-4). Another preferred recombinant adenovirus according to this specific embodiment of the present invention is the human adenovirus type 2 or 5 (Ad 2 or Ad 5) or an adenovirus of animal origin (French patent application No. FR-93.05954).

Retrovirus vectors and adeno-associated virus vectors are generally understood to be the recombinant gene delivery systems of choice for the transfer of exogenous polynucleotides in vivo, particularly to mammals, including humans. These vectors provide efficient delivery of genes into cells, and the transferred nucleic acids are stably integrated into the chromosomal DNA of the host.

Particularly preferred retroviruses for the preparation or construction of retroviral in vitro or in vivo gene delivery vehicles of the present invention include retroviruses selected from the group consisting of Mink-Cell Focus Inducing Virus, Murine Sarcoma Virus, Reticuloendotheliosis virus and Rous Sarcoma virus. Particularly preferred Murine Leukemia Viruses include the 4070A and the 1504A viruses, Abelson (ATCC No VR-999), Friend (ATCC No VR-245), Gross (ATCC No VR-590), Rauscher (ATCC No VR-998) and Moloney Murine Leukemia Virus (ATCC No VR-190; PCT Application No WO 94/24298). Particularly preferred Rous Sarcoma Viruses include Bryan high titer (ATCC Nos VR-334, VR-657, VR-726, VR-659 and VR-728). Other preferred retroviral vectors are those described in Roth et al. (1996), PCT Application No WO 93/25234, PCT Application No WO 94/06920, Roux et al., ((1989) Proc Natl Acad Sci U S A December;86(23):9079-83), Julan et al., (1992) J. Gen. Virol. 3:3251-3255 and Neda et al., ((1991) J Biol Chem August 5;266(22):14143-6).

Yet another viral vector system that is contemplated by the invention consists of the adeno-associated virus (AAV). The adeno-associated virus is a naturally occurring defective virus that requires another virus, such as an adenovirus or a herpes virus, as a helper virus for efficient replication and a productive life cycle (Muzyczka et al., (1992) Curr Top Microbiol Immunol;158:97-129). It is also one of the few viruses that may integrate its DNA into non-dividing cells, and exhibits a high frequency of stable integration (Flotte et al., (1992) Am J Respir Cell Mol Biol September;7(3):349-56; Samulski et al., (1989) J Virol September;63(9):3822-8; McLaughlin et al., (1989) Am. J. Hum. Genet. 59:561-569). One advantageous feature of AAV derives from its reduced efficacy for transducing primary cells relative to transformed cells.

5) Delivery of the Recombinant Vectors

In order to effect expression of the polynucleotides of the invention, these constructs must be delivered into a cell. This delivery may be accomplished in vitro, as in laboratory procedures for transforming cell lines, or in vivo or ex vivo, as in the treatment of certain disease states.

One mechanism is viral infection where the expression construct is encapsulated in an infectious viral particle.

Several non-viral methods for the transfer of polynucleotides into cultured mammalian cells are also contemplated by the present invention, and include, without being limited to, calcium phosphate precipitation (Graham et al., (1973) Virology August;54(2):536-9; Chen et al., (1987) Mol Cell Biol August;7(8):2745-52), DEAE-dextran (Gopal, (1985) Mol Cell Biol May;5(5):1188-90), electroporation (Tur-Kaspa et al., (1986) Mol Cell Biol Feberary;6(2):716-8; Potter et al., (1984) Proc Natl Acad Sci USA November;81(22):7161-5.), direct microinjection (Harland et al., (1985) J Cell Biol September;101(3):1094-9), DNA-loaded liposomes (Nicolau et al., (1982) Biochim Biophys Acta October 11;721(2):185-90; Fraley et al., (1979) Proc Natl Acad Sci USA July;76(7):3348 -52), and receptor-mediated transfection (Wu and Wu, (1987) J Biol Chem April 5;262(10):4429 -32; Wu and Wu (1988) Biochemistry February 9;27(3):887-92). Some of these techniques may be successfully adapted for in vivo or ex vivo use.

Once the expression polynucleotide has been delivered into the cell, it may be stably integrated into the genome of the recipient cell. This integration may be in the cognate location and orientation via homologous recombination (gene replacement) or it may be integrated in a random, non specific location (gene augmentation). In yet further embodiments, the nucleic acid may be stably maintained in the cell as a separate, episomal segment of DNA. Such nucleic acid segments or “episomes” encode sequences sufficient to permit maintenance and replication independent of or in synchronization with the host cell cycle.

One specific embodiment for a method for delivering a protein or peptide to the interior of a cell of a vertebrate in vivo comprises the step of introducing a preparation comprising a physiologically acceptable carrier and a naked polynucleotide operatively coding for the polypeptide of interest into the interstitial space of a tissue comprising the cell, whereby the naked polynucleotide is taken up into the interior of the cell and has a physiological effect. This is particularly applicable for transfer in vitro but it may be applied to in vivo as well.

Compositions for use in vitro and in vivo comprising a “naked” polynucleotide are described in PCT application No. WO 90/11092 (Vical Inc.) and also in PCT application No. WO 95/11307 (Institut Pasteur, INSERM, Université d'Ottawa) as well as in the articles of Tascon et al. (1996) Nature Medicine. 2(8):888-892 and of Huygen et al. ((1996) Nat Med August;2(8):893-8).

In still another embodiment of the invention, the transfer of a naked polynucleotide of the invention, including a polynucleotide construct of the invention, into cells may be proceeded with a particle bombardment (biolistic), said particles being DNA-coated microprojectiles accelerated to a high velocity allowing them to pierce cell membranes and enter cells without killing them, such as described by Klein et al. ((1990) Curr Genet February;17(2):97-103).

In a further embodiment, the polynucleotide of the invention may be entrapped in a liposome (Ghosh and Bacchawat, (1991) Targeted Diagn Ther;4:87-103; Wong et al., (1980) Gene 10:87-94; Nicolau et al., (1987) Methods Enzymol.;149:157-76). These liposomes may further be targeted to cells expressing LSR by incorporating leptin, triglycerides, Acrp30, or other known LSR ligands into the liposome membrane.

In a specific embodiment, the invention provides a composition for the in vivo production of an Apm1 globular head polypeptide described herein. It comprises a naked polynucleotide operatively coding for this polypeptide, in solution in a physiologically acceptable carrier, and suitable for introduction into a tissue to cause cells of the tissue to express the said polypeptide.

The amount of vector to be injected to the desired host organism varies according to the site of injection. As an indicative dose, it will be injected between 0.1 and 100 μg of the vector in an animal body, preferably a mammal body, for example a mouse body.

In another embodiment of the vector according to the invention, it may be introduced in vitro in a host cell, preferably in a host cell previously harvested from the animal to be treated and more preferably a somatic cell such as a muscle cell. In a subsequent step, the cell that has been transformed with the vector coding for the desired Apm1 globular head polypeptide or the desired fragment thereof is reintroduced into the animal body in order to deliver the recombinant protein within the body either locally or systemically.

IV. Recombinant Cells of the Invention

Another object of the invention consists of host cells recombinant for, i.e., that have been transformed or transfected with one of the polynucleotides described herein, and more precisely a polynucleotide comprising a polynucleotide encoding an OBG3 polypeptide fragment of the invention such as any one of those described in “Polynucleotides of the Invention”. These polynucleotides can be present in cells as a result of transient or stable transfection. The invention includes host cells that are transformed (prokaryotic cells) or that are transfected (eukaryotic cells) with a recombinant vector such as any one of those described in “Recombinant Vectors of the Invention”.

Generally, a recombinant host cell of the invention comprises at least one of the polynucleotides or the recombinant vectors of the invention that are described herein.

Preferred host cells used as recipients for the recombinant vectors of the invention are the following:

a) Prokaryotic host cells: Escherichia coli strains (I.E. DH5-α strain), Bacillus subtilis, Salmonella typhimurium, and strains from species like Pseudomonas, Streptomyces and Staphylococcus, and

b) Eukaryotic host cells: HeLa cells (ATCC No CCL2; No CCL2.1; No CCL2.2), Cv 1 cells (ATCC No CCL70), COS cells (ATCC No CRL1650; No CRL1651), Sf-9 cells (ATCC No CRL1711), C127 cells (ATCC No CRL-1804), 3T3 (ATCC No CRL-6361), CHO (ATCC No CCL-61), human kidney 293 (ATCC No 45504; No CRL-1573), BHK (ECACC No 84100501;No 84111301), PLC cells, HepG2, and Hep3B.

The constructs in the host cells can be used in a conventional manner to produce the gene product encoded by the recombinant sequence.

Following transformation of a suitable host and growth of the host to an appropriate cell density, the selected promoter is induced by appropriate means, such as temperature shift or chemical induction, and cells are cultivated for an additional period.

Cells are typically harvested by centrifugation, disrupted by physical or chemical means, and the resulting crude extract retained for further purification.

Microbial cells employed in the expression of proteins can be disrupted by any convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing agents. Such methods are well known by the skilled artisan.

Further, according to the invention, these recombinant cells can be created in vitro or in vivo in an animal, preferably a mammal, most preferably selected from the group consisting of mice, rats, dogs, pigs, sheep, cattle, and primates, not to include humans. Recombinant cells created in vitro can also be later surgically implanted in an animal, for example. Methods to create recombinant cells in vivo in animals are well-known in the art.

The present invention also encompasses primary, secondary, and immortalized homologously recombinant host cells of vertebrate origin, preferably mammalian origin and particularly human origin, that have been engineered to: a) insert exogenous (heterologous) polynucleotides into the endogenous chromosomal DNA of a targeted gene, b) delete endogenous chromosomal DNA, and/or c) replace endogenous chromosomal DNA with exogenous polynucleotides. Insertions, deletions, and/or replacements of polynucleotide sequences may be to the coding sequences of the targeted gene and/or to regulatory regions, such as promoter and enhancer sequences, operably associated with the targeted gene.

The present invention further relates to a method of making a homologously recombinant host cell in vitro or in vivo, wherein the expression of a targeted gene not normally expressed in the cell is altered. Preferably the alteration causes expression of the targeted gene under normal growth conditions or under conditions suitable for producing the polypeptide encoded by the targeted gene. The method comprises the steps of: (a) transfecting the cell in vitro or in vivo with a polynucleotide construct, the polynucleotide construct comprising; (i) a targeting sequence; (ii) a regulatory sequence and/or a coding sequence; and (iii) an unpaired splice donor site, if necessary, thereby producing a transfected cell; and (b) maintaining the transfected cell in vitro or in vivo under conditions appropriate for homologous recombination.

The present invention further relates to a method of altering the expression of a targeted gene in a cell in vitro or in vivo wherein the gene is not normally expressed in the cell, comprising the steps of: (a) transfecting the cell in vitro or in vivo with a polynucleotide construct, the polynucleotide construct comprising: (i) a targeting sequence; (ii) a regulatory sequence and/or a coding sequence; and (iii) an unpaired splice donor site, if necessary, thereby producing a transfected cell; and (b) maintaining the transfected cell in vitro or in vivo under conditions appropriate for homologous recombination, thereby producing a homologously recombinant cell; and (c) maintaining the homologously recombinant cell in vitro or in vivo under conditions appropriate for expression of the gene.

The present invention further relates to a method of making a polypeptide of the present invention by altering the expression of a targeted endogenous gene in a cell in vitro or in vivo wherein the gene is not normally expressed in the cell, comprising the steps of: a) transfecting the cell in vitro with a polynucleotide construct, the polynucleotide construct comprising: (i) a targeting sequence; (ii) a regulatory sequence and/or a coding sequence; and (iii) an unpaired splice donor site, if necessary, thereby producing a transfected cell; (b) maintaining the transfected cell in vitro or in vivo under conditions appropriate for homologous recombination, thereby producing a homologously recombinant cell; and c) maintaining the homologously recombinant cell in vitro or in vivo under conditions appropriate for expression of the gene thereby making the polypeptide.

The present invention further relates to a polynucleotide construct that alters the expression of a targeted gene in a cell type in which the gene is not normally expressed. This occurs when a polynucleotide construct is inserted into the chromosomal DNA of the target cell, wherein the polynucleotide construct comprises: a) a targeting sequence; b) a regulatory sequence and/or coding sequence; and c) an unpaired splice-donor site, if necessary. Further included are polynucleotide constructs, as described above, wherein the construct further comprises a polynucleotide which encodes a polypeptide and is in-frame frame with the targeted endogenous gene after homologous recombination with chromosomal DNA.

The compositions may be produced, and methods performed, by techniques known in the art, such as those described in U.S. Patent Nos: U.S. Pat. Nos. 6,054,288; 6,048,729; 6,048,724; 6,048,524; 5,994,127; 5,968,502; 5,965,125; 5,869,239; 5,817,789; 5,783,385; 5,733,761; 5,641,670; 5,580,734; International Publication Nos:WO96/29411, WO 94/12650; and scientific articles described by Koller et al., (1994) Annu. Rev. Immunol. 10:705-730; the disclosures of each of which are incorporated by reference in their entireties).

The OBG3 gene expression in mammalian, and typically human, cells may be rendered defective, or alternatively it may be enhanced, with the insertion of an OBG3 genomic or cDNA sequence with the replacement of the OBG3 gene counterpart in the genome of an animal cell by an OBG3 polynucleotide according to the invention. These genetic alterations may be generated by homologous recombination events using specific DNA constructs that have been previously described.

One kind of host cell that may be used are mammalian zygotes, such as murine zygotes. For example, murine zygotes may undergo microinjection with a purified DNA molecule of interest, for example a purified DNA molecule that has previously been adjusted to a concentration range from 1 ng/ml—for BAC inserts—3 ng/μl—for P1 bacteriophage inserts—in 10 mM Tris-HCl, pH 7.4, 250 μM EDTA containing 100 mM NaCl, 30 μM spermine, and 70 μM spermidine. When the DNA to be microinjected has a large size, polyamines and high salt concentrations can be used in order to avoid mechanical breakage of this DNA, as described by Schedl et al. ((1993) Nature March 18;362(6417):258-61).

Any one of the polynucleotides of the invention, including the DNA constructs described herein, may be introduced in an embryonic stem (ES) cell line, preferably a mouse ES cell line. ES cell lines are derived from pluripotent, uncommitted cells of the inner cell mass of pre-implantation blastocysts. Preferred ES cell lines are the following: ES-E14TG2a (ATCC No.CRL-1821), ES-D3 (ATCC No.CRL1934 and No. CRL-11632), YS001 (ATCC No. CRL-11776), 36.5 (ATCC No. CRL-11116). To maintain ES cells in an uncommitted state, they are cultured in the presence of growth inhibited feeder cells which provide the appropriate signals to preserve this embryonic phenotype and serve as a matrix for ES cell adherence. Preferred feeder cells are primary embryonic fibroblasts that are established from tissue of day 13-day 14 embryos of virtually any mouse strain, that are maintained in culture, such as described by Abbondanzo et al. (1993; Methods Enzymol;225:803-23) and are inhibited in growth by irradiation, such as described by Robertson ((1987) Embryo-derived stem cell lines. In: E. J. Robertson Ed. Teratocarcinomas and embrionic stem cells: a practical approach. IRL Press, Oxford), or by the presence of an inhibitory concentration of LIF, such as described by Pease and Williams (1990; Exp Cell Res. October;190(2):209-11).

The constructs in the host cells can be used in a conventional manner to produce the gene product encoded by the recombinant sequence.

Following transformation of a suitable host and growth of the host to an appropriate cell density, the selected promoter is induced by appropriate means, such as temperature shift or chemical induction, and cells are cultivated for an additional period. Cells are typically harvested by centrifugation, disrupted by physical or chemical means, and the resulting crude extract retained for further purification. Microbial cells employed in the expression of proteins can be disrupted by any convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing agents. Such methods are well known by the skilled artisan.

IV. Transgenic Animals

The present invention also provides methods and compositions for the generation of non-human animals and plants that express recombinant OBG3 polypeptides, i.e. recombinant OBG3 fragments or full-length OBG3 polypeptides. The animals or plants can be transgenic, i.e. each of their cells contains a gene encoding the OBG3 polypeptide, or, alternatively, a polynucleotide encoding the polypeptide can be introduced into somatic cells of the animal or plant, e.g. into mammary secretory epithelial cells of a mammal. In preferred embodiments, the non-human animal is a mammal such as a cow, sheep, goat, pig, or rabbit.

Methods of making transgenic animals such as mammals are well known to those of skill in the art, and any such-method can be used in the present invention. Briefly, transgenic mammals can be produced, e.g., by transfecting a pluripotential stem cell such as an ES cell with a polynucleotide encoding a polypeptide of interest. Successfully transformed ES cells can then be introduced into an early stage embryo which is then implanted into the uterus of a mammal of the same species. In certain cases, the transformed (“transgenic”) cells will comprise part of the germ line of the resulting animal, and adult animals comprising the transgenic cells in the germ line can then be mated to other animals, thereby eventually producing a population of transgenic animals that have the transgene in each of their cells, and which can stably transmit the transgene to each of their offspring. Other methods of introducing the polynucleotide can be used, for example introducing the polynucleotide encoding the polypeptide of interest into a fertilized egg or early stage embryo via microinjection. Alternatively, the transgene may be introduced into an animal by infection of zygotes with a retrovirus containing the transgene (Jaenisch, R. (1976) Proc. Natl. Acad. Sci. USA 73, 1260-1264). Methods of making transgenic mammals are described, e.g., in Wall et al. (1992) J Cell Biochem 1992 June;49(2):113-20; Hogan, et al. (1986) in Manipulating the mouse embryo. A Laboratory Manual. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.; in WO 91/08216, or in U.S. Pat. No. 4,736,866.

In a preferred method, the polynucleotides ares microinjected into the fertilized oocyte. Typically, fertilized oocytes are microinjected using standard techniques, and then cultured in vitro until a “pre-implantation embryo” is obtained. Such pre-implantation embryos preferably contain approximately 16 to 150 cells. Methods for culturing fertilized oocytes to the pre-implantation stage are described, e.g., by Gordon et al. ((1984) Methods in Enzymology, 101, 414); Hogan et al. ((1986) in Manipulating the mouse embryo. A Laboratory Manual. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y) (for the mouse embryo); Hammer et al. ((1985) Nature, 315, 680) (for rabbit and porcine embryos); Gandolfi et al. ((1987) J. Reprod. Fert. 81, 23-28); Rexroad et al. ((1988) J. Anim. Sci. 66, 947-953) (for ovine embryos); and Eyestone et al. ((1989) J. Reprod. Fert. 85, 715-720); Camous et al. ((1984) J. Reprod. Fert. 72, 779-785); and Heyman et al. ((1987) Theriogenology 27, 5968) (for bovine embryos); the disclosures of each of which are incorporated herein in their entireties. Pre-implantation embryos are then transferred to an appropriate female by standard methods to permit the birth of a transgenic or chimeric animal, depending upon the stage of development when the transgene is introduced.

As the frequency of transgene incorporation is often low, the detection of transgene integration in pre-implantation embryos is often desirable using any of the herein-described methods. Any of a number of methods can be used to detect the presence of a transgene in a pre-implantation embryo. For example, one or more cells may be removed from the pre-implantation embryo, and the presence or absence of the transgene in the removed cell or cells can be detected using any standard method e.g. PCR. Alternatively, the presence of a transgene can be detected in utero or post partum using standard methods.

In a particularly preferred embodiment of the present invention, transgenic mammals are generated that secrete recombinant OBG3 polypeptides in their milk. As the mammary gland is a highly efficient protein-producing organ, such methods can be used to produce protein concentrations in the gram per liter range, and often significantly more. Preferably, expression in the mammary gland is accomplished by operably linking the polynucleotide encoding the OBG3 polypeptide to a mammary gland specific promoter and, optionally, other regulatory elements. Suitable promoters and other elements include, but are not limited to, those derived from mammalian short and long WAP, alpha, beta, and kappa, casein, alpha and beta lactoglobulin, beta-CN 5′ genes, as well as the the mouse mammary tumor virus (MMTV) promoter. Such promoters and other elements may be derived from any mammal, including, but not limited to, cows, goats, sheep, pigs, mice, rabbits, and guinea pigs. Promoter and other regulatory sequences, vectors, and other relevant teachings are provided, e.g., by Clark (1998) J Mammary Gland Biol Neoplasia 3:337-50; Jost et al. (1999) Nat. Biotechnol 17:160-4; U.S. Pat. Nos. 5,994,616; 6,140,552; 6,013,857; Sohn et al. (1999) DNA Cell Biol. 18:845-52; Kim et al. (1999) J. Biochem. (Japan) 126:320-5; Soulier et al. (1999) Euro. J. Biochem. 260:533-9; Zhang et al. (1997) Chin. J. Biotech. 13:271-6; Rijnkels et al. (1998) Transgen. Res. 7:5 -14; Korhonen et al. (1997) Euro. J. Biochem. 245:482-9; Uusi-Oukari et al. (1997) Transgen. Res. 6:75-84; Hitchin et al. (1996) Prot. Expr. Purif. 7:247-52; Platenburg et al. (1994) Transgen. Res. 3:99-108; Heng-Cherl et al. (1993) Animal Biotech. 4:89-107; and Christa et al. (2000) Euro. J. Biochem. 267:1665-71; the entire disclosures of each of which is herein incorporated by reference.

In another embodiment, the polypeptides of the invention can be produced in milk by introducing polynucleotides encoding the polypeptides into somatic cells of the mammary gland in vivo, e.g. mammary secreting epithelial cells. For example, plasmid DNA can be infused through the nipple canal, e.g. in association with DEAE-dextran (see, e.g., Hens et al. (2000) Biochim. Biophys. Acta 1523:161-171), in association with a ligand that can lead to receptor-mediated endocytosis of the construct (see, e.g., Sobolev et al. (1998) 273:7928-33), or in a viral vector such as a retroviral vector, e.g. the Gibbon ape leukemia virus (see, e.g., Archer et al. (1994) PNAS 91:6840-6844). In any of these embodiments, the polynucleotide may be operably linked to a mammary gland specific promoter, as described above, or, alternatively, any strongly expressing promoter such as CMV or MoMLV LTR.

The suitability of any vector, promoter, regulatory element, etc. for use in the present invention can be assessed beforehand by transfecting cells such as mammary epithelial cells, e.g. MacT cells (bovine mammary epithelial cells) or GME cells (goat mammary epithelial cells), in vitro and assessing the efficiency of transfection and expression of the trans gene in the cells.

For in vivo administration, the polynucleotides can be administered in any suitable formulation, at any of a range of concentrations (e.g. 1-500 μg/ml, preferably 50-100 μg/ml), at any volume (e.g. 1-100 ml, preferably 1 to 20 ml), and can be administered any number of times (e.g. 1, 2, 3, 5, or 10 times), at any frequency (e.g. every 1, 2, 3, 5, 10, or any number of days). Suitable concentrations, frequencies, modes of administration, etc. will depend upon the particular polynucleotide, vector, animal, etc., and can readily be determined by one of skill in the art.

In a preferred embodiment, a retroviral vector such as as Gibbon ape leukemia viral vector is used, as described in Archer et al. ((1994) PNAS 91:6840-6844). As retroviral infection typically requires cell division, cell division in the mammary glands can be stimulated in conjunction with the administration of the vector, e.g. using a factor such as estrodiol benzoate, progesterone, reserpine, or dexamethasone. Further, retroviral and other methods of infection can be facilitated using accessory compounds such as polybrene.

In any of the herein-described methods for obtaining OBG3 polypeptides from milk, the quantity of milk obtained, and thus the quantity of OBG3 polypeptides produced, can be enhanced using any standard method of lacation induction, e.g. using hexestrol, estrogen, and/or progesterone.

The polynucleotides used in such embodiments can either encode a full-length OBG3 polypeptide or an OBG3 fragment. Typically, the encoded polypeptide will include a signal sequence to ensure the secretion of the protein into the milk. Where a full length OBG3 sequence is used, the full length protein can, e.g., be isolated from milk and cleaved in vitro using a suitable protease. Alternatively, a second, protease-encoding polynucleotide can be introduced into the animal or into the mammary gland cells, whereby expression of the protease results in the cleavage of the OBG3 polypeptide in vivo, thereby allowing the direct isolation of OBG3 fragments from milk.

V. Pharmaceutical or Physiologically Acceptable Compositions of the Invention

The OBG3 and gOBG3 polypeptide fragments of the invention can be administered to non-human animals and/or humans, alone or in pharmaceutical or physiologically acceptable compositions where they are mixed with suitable carriers or excipient(s). The pharmaceutical or physiologically acceptable composition is then provided at a therapeutically effective dose. A therapeutically effective dose refers to that amount of OBG3 or gOBG3 fragment sufficient to result in prevention or amelioration of symptoms or physiological status of obesity-related diseases or disorders as determined by the methods described herein. A therapeutically effective dose can also refer to the amount of OBG3 or gOBG3 fragment necessary for a reduction in weight or a prevention of an increase in weight or prevention of an increase in the rate of weight gain in persons desiring this affect for cosmetic reasons. A therapeutically effective dosage of an OBG3 or gOBG3 fragment of the invention is that dosage that is adequate to promote weight loss or weight gain with continued periodic use or administration. Techniques for formulation and administration of OBG3 polypeptide fragments may be found in “Remington's Pharmaceutical Sciences,” Mack Publishing Co., Easton, Pa., latest edition.

Other diseases or disorders that OBG3 polypeptide fragments of the invention could be used to treat or prevent include, but are not limited to, obesity and obesity-related diseases and disorders such as obesity, insulin resistance, atherosclerosis, atheromatous disease, heart disease, hypertension, stroke, Syndrome X, Noninsulin Dependent Diabetes Mellitus (NIDDM, or Type II diabetes) and Insulin Dependent Diabetes Mellitus (IDDM or Type I diabetes). Diabetes-related complications to be treated by the methods of the invention include microangiopathic lesions, ocular lesions, retinopathy, neuropathy, and renal lesions. Heart disease includes, but is not limited to, cardiac insufficiency, coronary insufficiency, and high blood pressure. Other obesity-related disorders to be treated by compounds of the invention include hyperlipidemia and hyperuricemia. Yet other obesity-related diseases or disorders of the invention include cachexia, wasting, AIDS-related weight loss, anorexia, and bulimia. The OBG3 or gOBG3 polypeptide fragments may also be used to enhance physical performance during work or exercise or enhance a feeling of general well-being. Physical performance activities include walking, running, jumping, lifting and/or climbing.

The OBG3 or gOBG3 polypeptide fragments or antagonists thereof may also be used to treat dyslexia, attention-deficit disorder (ADD), attention-deficit/hyperactivity disorder (ADHD), and psychiatric disorders such as schizophrenia by modulating fatty acid metabolism, more specifically, the production of certain long-chain polyunsaturated fatty acids.

It is expressly considered that the OBG3 or gOBG3 polypeptide fragments of the invention may be provided alone or in combination with other pharmaceutically or physiologically acceptable compounds. Other compounds useful for the treatment of obesity and other diseases and disorders are currently well-known in the art.

In a preferred embodiment, the OBG3 or gOBG3 polypeptide fragments are useful for, and used in, the treatment of insulin resistance and diabetes using methods described herein and known in the art. More particularly, a preferred embodiments relates to process for the therapeutic modification and regulation of glucose metabolism in an animal or human subject, which comprises administering to a subject in need of treatment (alternatively on a timed daily basis) an OBG or OBG3 polypeptide fragment (or polynucleotide encoding said polypeptide) in dosage amount and for a period sufficient to reduce plasma glucose levels in said animal or human subject.

Further preferred embodiments relate to methods for the prophylaxis or treatment of diabetes comprising administering to a subject in need of treatment (alternatively on a timed daily basis) an OBG or OBG3 polypeptide fragment (or polynucleotide encoding said polypeptide) in dosage amount and for a period sufficient to reduce plasma glucose levels in said animal or human subject.

Routes of Administration.

Suitable routes of administration include oral, nasal, rectal, transmucosal, or intestinal administration, parenteral delivery, including intramuscular, subcutaneous, intramedullary injections, as well as intrathecal, direct intraventricular, intravenous, intraperitoneal, intranasal, intrapulmonary (inhaled) or intraocular injections using methods known in the art. A particularly useful method of administering compounds for promoting weight loss involves surgical implantation, for example into the abdominal cavity of the recipient, of a device for delivering OBG3or gOBG3 polypeptide fragments over an extended period of time. Other particularly preferred routes of administration are aerosol and depot formulation. Sustained release formulations, particularly depot, of the invented medicaments are expressly contemplated.

Composition/Formulation

Pharmaceutical or physiologically acceptable compositions and medicaments for use in accordance with the present invention may be formulated in a conventional manner using one or more physiologically acceptable carriers comprising excipients and auxiliaries. Proper formulation is dependent upon the route of administration chosen.

Certain of the medicaments described herein will include a pharmaceutically or physiologically acceptable acceptable carrier and at least one polypeptide that is a OBG3 polypeptide fragment of the invention. For injection, the agents of the invention may be formulated in aqueous solutions, preferably in physiologically compatible buffers such as Hanks's solution, Ringer's solution, or physiological saline buffer such as a phosphate or bicarbonate buffer. For transmucosal administration, penetrants appropriate to the barrier to be permeated are used in the formulation. Such penetrants are generally known in the art.

Pharmaceutical or physiologically acceptable preparations that can be taken orally include push-fit capsules made of gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol or sorbitol. The push-fit capsules can contain the active ingredients in admixture with fillers such as lactose, binders such as starches, and/or lubricants such as talc or magnesium stearate and, optionally, stabilizers. In soft capsules, the active compounds may be dissolved or suspended in suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycols. In addition, stabilizers may be added. All formulations for oral administration should be in dosages suitable for such administration.

For buccal administration, the compositions may take the form of tablets or lozenges formulated in conventional manner.

For administration by inhalation, the compounds for use according to the present invention are conveniently delivered in the form of an aerosol spray presentation from pressurized packs or a nebulizer, with the use of a suitable gaseous propellant, e.g., carbon dioxide. In the case of a pressurized aerosol the dosage unit may be determined by providing a valve to deliver a metered amount. Capsules and cartridges of, e.g., gelatin, for use in an inhaler or insufflator, may be formulated containing a powder mix of the compound and a suitable powder base such as lactose or starch.

The compounds may be formulated for parenteral administration by injection, e.g., by bolus injection or continuous infusion. Formulations for injection may be presented in unit dosage form, e.g., in ampoules or in multi-dose containers, with an added preservative. The compositions may take such forms as suspensions, solutions or emulsions in aqueous vehicles, and may contain formulatory agents such as suspending, stabilizing and/or dispersing agents.

Pharmaceutical or physiologically acceptable formulations for parenteral administration include aqueous solutions of the active compounds in water-soluble form. Aqueous suspensions may contain substances that increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or dextran. Optionally, the suspension may also contain suitable stabilizers or agents that increase the solubility of the compounds to allow for the preparation of highly concentrated solutions.

Alternatively, the active ingredient may be in powder or lyophilized form for constitution with a suitable vehicle, such as sterile pyrogen-free water, before use.

In addition to the formulations described previously, the compounds may also be formulated as a depot preparation. Such long acting formulations may be administered by implantation (for example subcutaneously or intramuscularly) or by intramuscular injection. Thus, for example, the compounds may be formulated with suitable polymeric or hydrophobic materials (for example as an emulsion in an acceptable oil) or ion exchange resins, or as sparingly soluble derivatives, for example, as a sparingly soluble salt.

Additionally, the compounds may be delivered using a sustained-release system, such as semipermeable matrices of solid hydrophobic polymers containing the therapeutic agent. Various sustained release materials have been established and are well known by those skilled in the art. Sustained-release capsules may, depending on their chemical nature, release the compounds for a few weeks up to over 100 days.

Depending on the chemical nature and the biological stability of the therapeutic reagent, additional strategies for protein stabilization may be employed.

The pharmaceutical or physiologically acceptable compositions also may comprise suitable solid or gel phase carriers or excipients. Examples of such carriers or excipients include but are not limited to calcium carbonate, calcium phosphate, various sugars, starches, cellulose derivatives, gelatin, and polymers such as polyethylene glycols.

Effective Dosage.

Pharmaceutical or physiologically acceptable compositions suitable for use in the present invention include compositions wherein the active ingredients are contained in an effective amount to achieve their intended purpose. More specifically, a therapeutically effective amount means an amount effective to prevent development of or to alleviate the existing symptoms of the subject being treated. Determination of the effective amounts is well within the capability of those skilled in the art, especially in light of the detailed disclosure provided herein.

For any compound used in the method of the invention, the therapeutically effective dose can be estimated initially from cell culture assays. For example, a dose can be formulated in animal models to achieve a circulating concentration range that includes or encompasses a concentration point or range shown to increase leptin or lipoprotein uptake or binding in an in vitro system. Such information can be used to more accurately determine useful doses in humans.

A therapeutically effective dose refers to that amount of the compound that results in amelioration of symptoms in a patient. Toxicity and therapeutic efficacy of such compounds can be determined by standard pharmaceutical procedures in cell cultures or experimental animals, e.g., for determining the LD50, (the dose lethal to 50% of the test population) and the ED50 (the dose therapeutically effective in 50% of the population). The dose ratio between toxic and therapeutic effects is the therapeutic index and it can be expressed as the ratio between LD50 and ED50. Compounds that exhibit high therapeutic indices are preferred.

The data obtained from these cell culture assays and animal studies can be used in formulating a range of dosage for use in humans. The dosage of such compounds lies preferably within a range of circulating concentrations that include the ED50, with little or no toxicity. The dosage may vary within this range depending upon the dosage form employed and the route of administration utilized. The exact formulation, route of administration and dosage can be chosen by the individual physician in view of the patient's condition. (See, e.g., Fingl et al., 1975, in “The Pharmacological Basis of Therapeutics”, Ch. 1).

Dosage amount and interval may be adjusted individually to provide plasma levels of the active compound which are sufficient to maintain or prevent weight loss or gain, depending on the particular situation. Dosages necessary to achieve these effects will depend on individual characteristics and route of administration.

Dosage intervals can also be determined using the value for the minimum effective concentration. Compounds should be administered using a regimen that maintains plasma levels above the minimum effective concentration for 10-90% of the time, preferably between 30-90%; and most preferably between 50-90%. In cases of local administration or selective uptake, the effective local concentration of the drug may not be related to plasma concentration.

The amount of composition administered will, of course, be dependent on the subject being treated, on the subject's weight, the severity of the affliction, the manner of administration and the judgment of the prescribing physician.

A preferred dosage range for the amount of an OBG3 polypeptide fragment of the invention, which can be administered on a daily or regular basis to achieve desired results, including a reduction in levels of circulating plasma triglyceride-rich lipoproteins, range from 0.01-0.5 mg/kg body mass. A more preferred dosage range is from 0.05-0.1 mg/kg. Of course, these daily dosages can be delivered or administered in small amounts periodically during the course of a day. It is noted that these dosage ranges are only preferred ranges and are not meant to be limiting to the invention.

VI. Methods of Treatment

Treatment of mice with gOBG3 polypeptide fragments results in decreased triglyceride levels, decreased free fatty acid levels, decreased glucose levels, and decreased body weight as well as increased muscle oxidation.

The invention is drawn inter alia to methods of preventing or treating obesity-related diseases and disorders comprising providing an individual in need of such treatment with an OBG3 or gOBG3 polypeptide fragment of the invention. Preferably, the OBG3 polypeptide fragment has obesity-related activity either in vitro or in vivo. Preferably the OBG3 polypeptide fragment is provided to the individual in a pharmaceutical composition that is preferably taken orally. Preferably the individual is a mammal, and most preferably a human. In preferred embodiments, the obesity-related disease or disorder is selected from the group consisting of atherosclerosis, cardiovascular disease, insulin resistance, hypertension, stroke, Syndrome X, Type I diabetes, Type II diabetes and lipoatrophic diabetes. Diabetes-related complications to be treated by the methods of the invention include microangiopathic lesions, ocular lesions, retinopathy, neuropathy and renal lesions. Heart disease includes, but is not limited to, cardiac insufficiency, coronary insufficiency, and high blood pressure. Other obesity-related disorders to be treated by compounds of the invention include hyperlipidemia, hypercholesterolemia, hypertriglyceridemia, and hyperuricemia. Yet other obesity-related diseases or disorders of the invention include cachexia, wasting, AIDS-related weight loss, neoplasia-related weight loss, anorexia, and bulimia. In highly preferred embodiments, OBG3 polypeptide polypeptide fragments in pharmaceutical compositions are used to modulate body weight in healthy individuals for cosmetic reasons.

The invention also features a method of preventing or treating obesity-related diseases and disorders comprising providing an individual in need of such treatment with a compound identified by assays of the invention (described in Section VI of the Preferred Embodiments of the Invention and in the Examples). Preferably these compounds antagonize or agonize effects of OBG3 or gOBG3 polypeptide fragments in cells in vitro, muscles ex vivo, or in animal models. Alternatively, these compounds agonize or antagonize the effects of OBG3 or gOBG3 polypeptide fragments on leptin and/or lipoprotein uptake and/or binding. Optionally, these compounds prevent the interaction, binding, or uptake of OBG3 or gOBG3 polypeptide fragments with LSR in vitro or in vivo. Preferably, the compound is provided to the individual in a pharmaceutical composition that is preferably taken orally. Preferably the individual is a mammal, and most preferably a human. In preferred embodiments, the obesity-related disease or disorder is selected from the group consisting of obesity and obesity-related diseases and disorders such as atherosclerosis, heart disease, insulin resistance, hypertension, stroke, Syndrome X, Type I diabetes, Type II diabetes, and lipoatrophic diabetes. Diabetes-related complications to be treated by the methods of the invention include microangiopathic lesions, ocular lesions, retinopathy, neuropathy and renal lesions. Heart disease includes, but is not limited to, cardiac insufficiency, coronary insufficiency, and high blood pressure. Other obesity-related related disorders to be treated by compounds of the invention include hyperlipidemia, hypercholesterolemia, hypertriglyceridemia, and hyperuricemia. Yet other obesity-related diseases or disorders of the invention include cachexia, wasting, AIDS-related weight loss, neoplasia-related weight loss, anorexia, and bulimia. In highly preferred embodiments, the pharmaceutical compositions are used to modulate body weight for cosmetic reasons.

In further preferred embodiments, the present invention of said pharmaceutical or physiologically acceptable composition can be used as a method to control blood glucose in some individuals, particularly those with Type I diabetes, Type II diabetes, or insulin resistance, in combination with insulin therapy.

In further preferred embodiments, the present invention of said pharmaceutical or physiologically acceptable composition can be used as a method to control body weight in some individuals, particularly those with Type I diabetes, Type II diabetes, or insulin resistance, in combination with insulin therapy.

In further preferred embodiments, the present invention of said pharmaceutical or physiologically acceptable composition can be used as a method to control blood glucose in some individuals, particularly those with Type I diabetes, Type II diabetes, or insulin resistance, alone, without combination of insulin therapy.

In further preferred embodiments, the present invention of said pharmaceutical or physiologically acceptable composition can be used as a method to control body weight in some individuals, particularly those with Type II diabetes or insulin resistance, alone, without combination of insulin therapy.

In a further preferred embodiment, the present invention may be used in complementary therapy, particularly in some individuals, particularly those with Type I diabetes, Type II diabetes, or insulin resistance, to improve their weight or glucose control in combination with an insulin secretagogue or an insulin sensitising agent. Preferably, the insulin secretagogue is 1,1-dimethyl-2-(2-morpholino phenyl)guanidine fumarate (BTS67582) or a sulphonylurea selected from tolbutamide, tolazamide, chlorpropamide, glibenclamide, glimepiride, glipizide and glidazide. Preferably, the insulin sensitising agent is selected from metformin, ciglitazone, troglitazone and pioglitazone.

The present invention further provides a method of improving the body weight or glucose control of some individuals, particularly those with Type I diabetes, Type II diabetes, or insulin resistance,alone, without an insulin secretagogue or an insulin sensitising agent.

In a further preferred embodiment, the present invention may be administered either concomitantly or concurrently, with an insulin secretagogue or an insulin sensitising agent for example in the form of separate dosage units to be used simultaneously, separately or sequentially (either before or after the secretagogue or either before or after the sensitising agent). Accordingly, the present invention further provides for a composition of pharmaceutical or physiologically acceptable composition and an insulin secretagogue or an insulin sensitising agent as a combined preparation for simultaneous, separate or sequential use for the improvement of body weight or glucose control in some individuals, particularly those with Type I diabetes, Type II diabetes, or insulin resistance.

In further preferred embodiments, the present invention of said pharmaceutical or physiologically acceptable composition further provides a method for the use as an insulin sensitiser.

In further preferred embodiments, the present invention of said pharmaceutical or physiologically acceptable composition can be used as a method to improve insulin sensitivity in some individuals, particularly those with Type I diabetes, Type II diabetes, or insulin resistance, in combination with insulin therapy.

In further preferred embodiments, the present invention of said pharmaceutical or physiologically acceptable composition can be used as a method to improve insulin sensitivity in some individuals, particularly those with Type II diabetes or insulin resistance,without insulin therapy.

More generally, the instant invention is drawn to treatment with OBG3 and gOBG3 polypeptide fragments where an individual is shown to have a particular genotype for an Apm1 marker (Apm1 designates the human homolog of the full-length OBG3 polypeptide), or where they have been shown to have a reduced amount of plasma Apm1, either full-length or preferably a more biologically active fragment of Apm1, as compared to control values, e.g. values representative of non-diseased individuals, or as compared to that individual prior to the onset of a disease or condition. In either case, treatment comprises providing pharmaceutically acceptable gOBG3 or OBG3 polypeptide fragments to the individual. The exact amount of OBG3 or gOBG3 fragment provided would be determined through clinical trials under the guidance of qualified physicians, but would be expected to be in the range of 5-7 mg per individual per day. In general, a preferred range would be from 0.5 to 14 mg per individual per day, with a highly preferred range being between 1 and 10 mg per individual per day. Individuals who could benefit from treatment with gOBG3 or OBG3 polypeptide fragments could be identified through at least two methods: plasma serum level determinations and genotyping.

OBG3/apM1 Levels

Preliminary studies have shown that obese people have lower levels of full-length OBG3/Apm1 than non-obese people. The invention envisions treatment of individuals (preferably obese) that have low levels of full-length OBG3/Apm1 with OBG3 or gOBG3 polypeptide fragments of the invention. In addition, the invention preferably is drawn to treatment of individuals with low levels of the biologically active fragment of OBG3/Apm1 with OBG3 or gOBG3 polypeptide fragments of the invention. In further embodiments, OBG3 or OBG3 polypeptide fragments of the present invention are administered to individuals, preferably obese individuals, that levels of full-length OBG3 (or alternatively a mature OBG3 polypeptide fragment) at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, about 100% or 100% lower than non-obese individuals, preferably healthy individuals as determined by a physician using normal standards in the art. Methods to determine and compare the levels of full-length OBG3 in individuals are well-known in the art and include, but are not limited to using an antibody specific for Apm1 in a format such as a Radio Immune Assay, ELISA, Western blot, dotblot, or as part of an array, for example. Methods of generating antibodies to, and detection of, Apm1 and fragments thereof (such as the 27 kDa apM1. product in human plasma) as well as to proteins with SNPs are included in the present invention and are discussed in PCT/IB99/01858, U.S. application Ser. No. 09/434,848, and WO 99/07736, hereby incorporated herein by reference in its entirety including and drawings, figures, or tables. Further, antibodies specific for OBG3/gOBG3 polypeptide fragments of the invention (e.g., antibodies that bind the 27 kDa apM1. fragment as well as full length apM1. and antibodies that bind the 27 kDa apM1. fragment but not the full length apM1), their generation, and their use are described herein. Particularly included in the present invention are methods of binding an antibody specific for the 27 kDa apM1. fragment to the 27 kDa apM1 fragment comprising the steps of: obtaining a biological sample comprising the 27 kDa apM1. fragment, contacting said 27 kDa fragment with said antibody under condition that allow specific binding of the antibody and the 27 kDa fragment to occur, and specifically binding said antibody to said 27 kDa fragment.

apM1 Genotyping

The methods treatment using genotyping to identify individuals that would benefit from treatments of the invention are based on the finding that single nucleotide polymorphisms (SNPs) in the Apm1 gene have been identified that show an association in obese adolescents with free fatty acid (FFA) and respiratory quotient levels, others that show an association with the relationship between BMI and leptin, and still others that show an association with glucose levels. Further, a combination of the Apm1 SNPs associated with FFA and leptin metabolism also predict people who will be seriously overweight (data not shown).

Apm1 SNPs and methods of genotyping are described in PCT/IB99/01858 as well as U.S. application Ser. No. 09/434,848, both of which are hereby incoporated herein in their entirety including any drawings, figure, or tables. Briefly, the term “genotype” as used herein refers to the identity of the alleles present in an individual or a sample. The term “genotyping” a sample or an individual for a biallelic marker consists of determining the specific allele or the specific nucleotide carried by an individual at a biallelic marker.

Methods of genotyping comprise determining the identity of a nucleotide at an apM1 biallelic marker site by any method known in the art. Preferably, microsequencing is used. The genotype is used to determine whether an individual should be treated with gOBG3 or OBG3 polypeptide fragments. Thus, these genotyping methods are performed on nucleic acid samples derived from a single individual. These methods are well-known in the art, and discussed fully in the applications referenced above and briefly below.

Any method known in the art can be used to identify the nucleotide present at a biallelic marker site. Since the biallelic marker allele to be detected has been identified and specified in the present invention, detection will prove simple for one of ordinary skill in the art by employing any of a number of techniques. Many genotyping methods require the previous amplification of the DNA region carrying the biallelic marker of interest. While the amplification of target or signal is often preferred at present, ultrasensitive detection methods that do not require amplification are also encompassed by the present genotyping methods.

Methods well-known to those skilled in the art that can be used to detect biallelic polymorphisms include methods such as conventional dot blot analysis, single strand conformational polymorphism analysis (SSCP; Orita et al. (1989) Proc Natl Acad Sci USA April;86(8):2766-70), denaturing gradient gel electrophoresis (DGGE), heteroduplex analysis, mismatch cleavage detection, and other conventional techniques as described in Sheffield et al. (1991; Am J Hum Genet October;49(4):699-706); White et al. (1992), Grompe et al. ((1989) Proc Natl Acad Sci USA August;86(15):5888-92; (1993) Nat Genet. October;5(2): 111-7). Another method for determining the identity of the nucleotide present at a particular polymorphic site employs a specialized exonuclease-resistant nucleotide derivative as described in U.S. Pat. No. 4,656,127.

Preferred methods involve directly determining the identity of the nucleotide present at a biallelic marker site by sequencing assay, allele-specific amplification assay, or hybridization assay. The following is a description of some preferred methods. A highly preferred method is the microsequencing technique. The term “sequencing” is used herein to refer to polymerase extension of duplex primer/template complexes and includes both traditional sequencing and microsequencing.

1) Sequencing Assays

The nucleotide present at a polymorphic site can be determined by sequencing methods. In a preferred embodiment, DNA samples are subjected to PCR amplification before sequencing using any method known in the art. Preferably, the amplified DNA is subjected to automated dideoxy terminator sequencing reactions using a dye-primer cycle sequencing protocol. Sequence analysis allows the identification of the base present at the biallelic marker site.

2) Microsequencing Assays

In microsequencing methods, the nucleotide at a polymorphic site in a target DNA is detected by a single nucleotide primer extension reaction. This method involves appropriate microsequencing primers that hybridize just upstream of the polymorphic base of interest in the target nucleic acid. A polymerase is used to specifically extend the 3′ end of the primer with one single ddNTP (chain terminator) complementary to the nucleotide at the polymorphic site. The identity of the incorporated nucleotide is then determined in any suitable way.

Typically, microsequencing reactions are carried out using fluorescent ddNTPs and the extended microsequencing primers are analyzed by electrophoresis on ABI 377 sequencing machines to determine the identity of the incorporated nucleotide as described in EP 412 883. Alternatively capillary electrophoresis can be used in order to process a higher number of assays simultaneously.

Different approaches can be used for the labeling and detection of ddNTPs. A homogeneous phase detection method based on fluorescence resonance energy transfer has been described by Chen and Kwok ((1997) Nucleic Acids Res. January 15;25(2):347-53) and Chen et al. ((1997) Proc Natl Acad Sci USA September 30;94(20):10756-61). In this method, amplified genomic DNA fragments containing polymorphic sites are incubated with a 5′-fluorescein-labeled primer in the presence of allelic dye-labeled dideoxyribonucleoside triphosphates and a modified Taq polymerase. The dye-labeled primer is extended one base by the dye-terminator specific for the allele present on the template. At the end of the genotyping reaction, the fluorescence intensities of the two dyes in the reaction mixture are analyzed directly without separation or purification. All these steps can be performed in the same tube and the fluorescence changes can be monitored in real time. Alternatively, the extended primer may be analyzed by MALDI-TOF Mass Spectrometry. The base at the polymorphic site is identified by the mass added onto the microsequencing primer (see Haff and Smirnov, (1997) Nucleic Acids Res. September 15;25(18):3749-50; (1997) Genome Res. April;7(4):378-88).

Microsequencing may be achieved by the established microsequencing method or by developments or derivatives thereof. Alternative methods include several solid-phase microsequencing techniques. The basic microsequencing protocol is the same as described previously, except that the method is conducted as a heterogeneous phase assay, in which the primer or the target molecule is immobilized or captured onto a solid support. To simplify the primer separation and the terminal nucleotide addition analysis, oligonucleotides are attached to solid supports or are modified in such ways that permit affinity separation as well as polymerase extension. The 5′ ends and internal nucleotides of synthetic oligonucleotides can be modified in a number of different ways to permit different affinity separation approaches, e.g., biotinylation. If a single affinity group is used on the oligonucleotides, the oligonucleotides can be separated from the incorporated terminator regent. This eliminates the need of physical or size separation. More than one oligonucleotide can be separated from the terminator reagent and analyzed simultaneously if more than one affinity group is used. This permits the analysis of several nucleic acid species or more nucleic acid sequence information per extension reaction. The affinity group need not be on the priming oligonucleotide but could alternatively be present on the template.

For example, immobilization can be carried out via an interaction between biotinylated DNA and streptavidin-coated microtitration wells or avidin-coated polystyrene particles. In the same manner, oligonucleotides or templates may be attached to a solid support in a high-density format. In such solid phase microsequencing reactions, incorporated ddNTPs can be radiolabeled (Syvänen, (1994) Clin Chim Acta. May;226(2):225-36) or linked to fluorescein (Livak and Hainer, (1994) Hum Mutat.;3(4):379-85). The detection of radiolabeled ddNTPs can be achieved through scintillation-based techniques. The detection of fluorescein-linked ddNTPs can be based on the binding of antifluorescein antibody conjugated with alkaline phosphatase, followed by incubation with a chromogenic substrate (such as p-nitrophenyl phosphate).

Other possible reporter-detection pairs include: ddNTP linked to dinitrophenyl (DNP) and anti-DNP alkaline phosphatase conjugate (Harju et al., (1993) Clin Chem. November;39(11 Pt 1):2282-7) or biotinylated ddNTP and horseradish peroxidase-conjugated streptavidin with o-phenylenediamine as a substrate (WO 92/15712). As yet another alternative solid-phase microsequencing procedure, Nyren et al. ((1993) Anal Biochem. January;208(1):171-5). Described a method relying on the detection of DNA polymerase activity by an enzymatic luminometric inorganic pyrophosphate detection assay (ELIDA).

Pastinen et al. ((1997) Genome Res. June;7(6):606-14) describe a method for multiplex detection of single nucleotide polymorphism in which the solid phase minisequencing principle is applied to an oligonucleotide array format. High-density arrays of DNA probes attached to a solid support (DNA chips) are further described below.

It will be appreciated that any primer having a 3′ end immediately adjacent to the polymorphic nucleotide may be used. Similarly, it will be appreciated that microsequencing analysis may be performed for any biallelic marker or any combination of biallelic markers of the present invention.

3) Allele-Specific Amplification Assay Methods

Discrimination between the two alleles of a biallelic marker can also be achieved by allele specific amplification, a selective strategy, whereby one of the alleles is amplified without, or at a much higher rate than, amplification of the other allele. This is accomplished by placing the polymorphic base at the 3′ end of one of the amplification primers. Because the extension forms from the 3′end of the primer, a mismatch at or near this position has an inhibitory effect on amplification. Therefore, under appropriate amplification conditions, these primers only direct amplification on their complementary allele. Determining the precise location of the mismatch and the corresponding assay conditions are well with the ordinary skill in the art.

The “Oligonucleotide Ligation Assay” (OLA) uses two oligonucleotides which are designed to be capable of hybridizing to abutting sequences of a single strand of a target molecules. One of the oligonucleotides is biotinylated, and the other is detectably labeled. If the precise complementary sequence is found in a target molecule, the oligonucleotides will hybridize such that their termini abut, and create a ligation substrate that can be captured and detected. OLA is capable of detecting single nucleotide polymorphisms and may be advantageously combined with PCR as described by Nickerson et al. ((1990) Proc Natl Acad Sci USA November;87(22):8923-7). In this method, PCR is used to achieve the exponential amplification of target DNA, which is then detected using OLA.

Other amplification methods which are particularly suited for the detection of single nucleotide polymorphism include LCR (ligase chain reaction) and Gap LCR (GLCR). LCR uses two pairs of probes to exponentially amplify a specific target. The sequences of each pair of oligonucleotides are selected to permit the pair to hybridize to abutting sequences of the same strand of the target. Such hybridization forms a substrate for a template-dependant ligase. In accordance with the present invention, LCR can be performed with oligonucleotides having the proximal and distal sequences of the same strand of a biallelic marker site.

In one embodiment, either oligonucleotide will be designed to include the biallelic marker site. In such an embodiment, the reaction conditions are selected such that the oligonucleotides can be ligated together only if the target molecule either contains or lacks the specific nucleotide that is complementary to the biallelic marker on the oligonucleotide.

In an alternative embodiment, the oligonucleotides will not include the biallelic marker, such that when they hybridize to the target molecule, a “gap” is created as described in WO 90/01069. This gap is then “filled” with complementary dNTPs (as mediated by DNA polymerase), or by an additional pair of oligonucleotides. Thus at the end of each cycle, each single strand has a complement capable of serving as a target during the next cycle and exponential allele-specific amplification of the desired sequence is obtained.

Ligase/Polymerase-mediated Genetic Bit Analysis™ is another method for determining the identity of a nucleotide at a preselected site in a nucleic acid molecule (WO 95/21271). This method involves the incorporation of a nucleoside triphosphate that is complementary to the nucleotide present at the preselected site onto the terminus of a primer molecule, and their subsequent ligation to a second oligonucleotide. The reaction is monitored by detecting a specific label attached to the reaction's solid phase or by detection in solution.

4) Hybridization Assay Methods

A preferred method of determining the identity of the nucleotide present at a biallelic marker site involves nucleic acid hybridization. The hybridization probes, which can be conveniently used in such reactions, preferably include probes specific for Apm1 cDNA surrounding Apm1 biallelic markers. Any hybridization assay may be used including Southern hybridization, Northern hybridization, dot blot hybridization and solid-phase hybridization (see Sambrook et al., supra).

Hybridization refers to the formation of a duplex structure by two single stranded nucleic acids due to complementary base pairing. Hybridization can occur between exactly complementary nucleic acid strands or between nucleic acid strands that contain minor regions of mismatch. Specific probes can be designed that hybridize to one form of a biallelic marker and not to the other and therefore are able to discriminate between different allelic forms. Allele-specific probes are often used in pairs, one member of a pair showing perfect match to a target sequence containing the original allele and the other showing a perfect match to the target sequence containing the alternative allele.

Hybridization conditions should be sufficiently stringent that there is a significant difference in hybridization intensity between alleles, and preferably an essentially binary response, whereby a probe hybridizes to only one of the alleles. Stringent, sequence specific hybridization conditions, under which a probe will hybridize only to the exactly complementary target sequence are well known in the art (Sambrook et al., supra). Stringent conditions are sequence dependent and will be different in different circumstances. Generally, stringent conditions are selected to be about 5° C. lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. Although such hybridizations can be performed in solution, it is preferred to employ a solid-phase hybridization assay. The target DNA comprising a biallelic marker of the present invention may be amplified prior to the hybridization reaction.

The presence of a specific allele in the sample is determined by detecting the presence or the absence of stable hybrid duplexes formed between the probe and the target DNA. The detection of hybrid duplexes can be carried out by a number of methods. Various detection assay formats are well known which utilize detectable labels bound to either the target or the probe to enable detection of the hybrid duplexes. Typically, hybridization duplexes are separated from unhybridized nucleic acids and the labels bound to the duplexes are then detected. Those skilled in the art will recognize that wash steps may be employed to wash away excess target DNA or probe as well as unbound conjugate. Further, standard heterogeneous assay formats are suitable for detecting the hybrids using the labels present on the primers and probes.

Two recently developed assays allow hybridization-based allele discrimination with no need for separations or washes (see Landegren U. et al., (1998) Genome Res. August;8(8):769-76). The TaqMan assay takes advantage of the 5′ nuclease activity of Taq DNA polymerase to digest a DNA probe annealed specifically to the accumulating amplification product. TaqMan probes are labeled with a donor-acceptor dye pair that interacts via fluorescence resonance energy transfer (FRET). Cleavage of the TaqMan probe by the advancing polymerase during amplification dissociates the donor dye from the quenching acceptor dye, greatly increasing the donor fluorescence. All reagents necessary to detect two allelic variants can be assembled at the beginning of the reaction and the results are monitored in real time (see Livak et al., 1995).

In an alternative homogeneous hybridization based procedure, molecular beacons are used for allele discriminations. Molecular beacons are hairpin-shaped oligonucleotide probes that report the presence of specific nucleic acids in homogeneous solutions. When they bind to their targets they undergo a conformational reorganization that restores the fluorescence of an internally quenched fluorophore (Tyagi et al., (1998) Nat Biotechnol. January;16(1):49-53).

The polynucleotides provided herein can be used to produce probes which can be used in hybridization assays for the detection of biallelic marker alleles in biological samples. These probes are characterized in that they preferably comprise between 8 and 50 nucleotides, and in that they are sufficiently complementary to a sequence comprising a biallelic marker of the present invention to hybridize thereto and preferably sufficiently specific to be able to discriminate the targeted sequence for only one nucleotide variation. A particularly preferred probe is 25 nucleotides in length. Preferably the biallelic marker is within 4 nucleotides of the center of the polynucleotide probe. In particularly preferred probes, the biallelic marker is at the center of said polynucleotide. In preferred embodiments the polymorphic base is within 5, 4, 3, 2, 1, nucleotides of the center of the said polynucleotide, more preferably at the center of said polynucleotide. Preferably the probes of the present invention are labeled or immobilized on a solid support.

By assaying the hybridization to an allele specific probe, one can detect the presence or absence of a biallelic marker allele in a given sample. High-Throughput parallel hybridizations in array format are specifically encompassed within “hybridization assays” and are described below.

5) Hybridization To Addressable Arrays Of Oligonucleotides

Hybridization assays based on oligonucleotide arrays rely on the differences in hybridization stability of short oligonucleotides to perfectly matched and mismatched target sequence variants. Efficient access to polymorphism information is obtained through a basic structure comprising high-density arrays of oligonucleotide probes attached to a solid support (e.g., the chip) at selected positions. Each DNA chip can contain thousands to millions of individual synthetic DNA probes arranged in a grid-like pattern and miniaturized to the size of a dime.

The chip technology has already been applied with success in numerous cases. For example, the screening of mutations has been undertaken in the BRCA1 gene, in S. cerevisiae mutant strains, and in the protease gene of HIV-1 virus (Hacia et al., (1996) Nat Genet. December;14(4):441-7; Shoemaker et al., (1996) Nat Genet December;14(4):450-6; Kozal et al., (1996) Nat Med. July;2(7):753-9). Chips of various formats for use in detecting biallelic polymorphisms can be produced on a customized basis by Affymetrix (GeneChip™), Hyseq (HyChip and HyGnostics), and Protogene Laboratories.

In general, these methods employ arrays of oligonucleotide probes that are complementary to target nucleic acid sequence segments from an individual, which target sequences include a polymorphic marker. EP 785280 describes a tiling strategy for the detection of single nucleotide polymorphisms.

Briefly, arrays may generally be “tiled” for a large number of specific polymorphisms. By “tiling” is generally meant the synthesis of a defined set of oligonucleotide probes which is made up of a sequence complementary to the target sequence of interest, as well as preselected variations of that sequence, e.g., substitution of one or more given positions with one or more members of the basis set of monomers, i.e. nucleotides. Tiling strategies are further described in PCT application No. WO 95/11995. In a particular aspect, arrays are tiled for a number of specific, identified biallelic marker sequences. In particular, the array is tiled to include a number of detection blocks, each detection block being specific for a specific biallelic marker or a set of biallelic markers.

For example, a detection block may be tiled to include a number of probes, which span the sequence segment that includes a specific polymorphism. To ensure probes that are complementary to each allele, the probes are synthesized in pairs differing at the biallelic marker. In addition to the probes differing at the polymorphic base, monosubstituted probes are also generally tiled within the detection block. These monosubstituted probes have bases at and up to a certain number of bases in either direction from the polymorphism, substituted with the remaining nucleotides (selected from A, T, G, C and U). Typically the probes in a tiled detection block will include substitutions of the sequence positions up to and including those that are 5 bases away from the biallelic marker. The monosubstituted probes provide internal controls for the tiled array, to distinguish actual hybridization from artefactual cross-hybridization. Upon completion of hybridization with the target sequence and washing of the array, the array is scanned to determine the position on the array to which the target sequence hybridizes. The hybridization data from the scanned array is then analyzed to identify which allele or alleles of the biallelic marker are present in the sample. Hybridization and scanning may be carried out as described in PCT application No. WO 92/10092 and WO 95/11995 and U.S. Pat. No. 5,424,186.

Thus, in some embodiments, the chips may comprise an array of nucleic acid sequences of fragments of about 15 nucleotides in length. In preferred embodiments the polymorphic base is within 5, 4, 3, 2, 1, nucleotides of the center of the said polynucleotide, more preferably at the center of said polynucleotide. In some embodiments, the chip may comprise an array of at least 2, 3, 4, 5, 6, 7, 8 or more of these polynucleotides.

6) Integrated Systems

Another technique, which may be used to analyze polymorphisms, includes multicomponent integrated systems, which miniaturize and compartmentalize processes such as PCR and capillary electrophoresis reactions in a single functional device. An example of such technique is disclosed in U.S. Pat. No. 5,589,136, which describes the integration of PCR amplification and capillary electrophoresis in chips.

Integrated systems can be envisaged mainly when microfluidic systems are used. These systems comprise a pattern of microchannels designed onto a glass, silicon, quartz, or plastic wafer included on a microchip. The movements of the samples are controlled by electric, electroosmotic or hydrostatic forces applied across different areas of the microchip to create functional microscopic valves and pumps with no moving parts. Varying the voltage controls the liquid flow at intersections between the micro-machined channels and changes the liquid flow rate for pumping across different sections of the microchip.

For genotyping biallelic markers, the microfluidic system may integrate nucleic acid amplification, microsequencing, capillary electrophoresis and a detection method such as laser-induced fluorescence detection.

In a first step, the DNA samples are amplified, preferably by PCR. Then, the amplification products are subjected to automated microsequencing reactions using ddNTPs (specific fluorescence for each ddNTP) and the appropriate oligonucleotide microsequencing primers which hybridize just upstream of the targeted polymorphic base. Once the extension at the 3′ end is completed, the primers are separated from the unincorporated fluorescent ddNTPs by capillary electrophoresis. The separation medium used in capillary electrophoresis can for example be polyacrylamide, polyethyleneglycol or dextran. The incorporated ddNTPs in the single-nucleotide primer extension products are identified by fluorescence detection. This microchip can be used to process at least 96 to 384 samples in parallel. It can use the usual four color laser induced fluorescence detection of the ddNTPs.

Apm1 Biallellic Markers

The apM1 biallelic markers currently identified are shown in Table 1 below. The markers that have been linked with either FFA levels or changes in the leptin/BMI index are A5, A6, A7 and A3, A4, respectively. A5, A6, and A7 are in complete linkage disequilibrium. Thus, if an individual's genotype at A6 is GG, then A7 will be AA, and both are linked with decreased FFA levels and would indicate that treatment with gOBG3 or OBG3 polypeptide fragments was appropriate for example. Similarly, if an individual's genotype at A4 is AC or CC, treatment with gOBG3 or OBG3 polypeptide fragments could be expected to be beneficial. Alternatively, if an individual has both an AA genotype at A7 and an AC or CC genotype at A4, treatment with gOBG3 or oBG3 polypeptide fragments is indicated.

The above-described associations between genotypes and risk factors and treatment are exemplary, only. Other associations that would also indicate individuals appropriate for gOBG3 or OBG3 fragment treatment (or inappropriate) can also be identified using the methods described in the art or PCT/IB99/01858, which disclosure is hereby incorporated by reference in its entirety. Associations that would indicate treatment would be those genotypes associated with changes in parameters that gOBG3 or OBG3 fragment administration has been shown to affect in a “positive” direction, e.g. the association with decrease in weight for treatment of obesity. Associations that would indicate that treatment should not be performed would be genotypes that indicated an adverse affect for diabetes treatment (negative effect on insulin levels for example) or weight loss.

TABLE 1 Biallelic Marker Localization in Polymor- Marker position Amplicon marker Name apM1 gene phism in SEQ ID No 7 9-27 A1 9-27/261 5'regulatory Allele 1:G  3787 region Allele 2:C 99-14387 A2 99- Intron 1 Allele 1:A 11118 14387/129 Allele 2:C 9-12 A3 9-12/48 Intron 1 Allele 1:T 15120 Allele 2:C 9-12 and A4 9-12/124 or Exon 2 Allele 1:T 15196 9-13 9-13/66 Allele 2:G 9-12 and A5 9-12/355 or Intron 2 Allele 1:G 15427 9-13 9-13/297 Allele 2:T 9-12 and A6 9-12/428 or Intron 2 Allele 1:A 15500 9-13 9-13/370 Allele 2:G 99-14405 A7 99- Intron 2 Allele 1:G 15863 14405/105 Allele 2:A 9-16 A8 9-16/189 Exon 3 Allele 1:A 17170 Allele 2: Del

Apm1 Association Studies

Association studies focus on population frequencies and rely on the phenomenon of linkage disequilibrium. Linkage disequilibrium is the deviation from random of the occurrence of pairs of specific alleles at different loci on the same chromosome. If a specific allele in a given gene is directly associated with a particular trait, its frequency will be statistically increased in an affected (trait positive) population, when compared to the frequency in a trait negative population or in a random control population. As a consequence of the existence of linkage disequilibrium, the frequency of all other alleles present in the haplotype carrying the trait-causing allele will also be increased in trait positive individuals compared to trait negative individuals or random controls. Therefore, association between the trait and any allele (specifically a biallelic marker allele) in linkage disequilibrium with the trait-causing allele will suffice to suggest the presence of a trait-related gene in that particular region.

Case-control populations can be genotyped for biallelic markers to identify associations that narrowly locate a trait causing allele, as any marker in linkage disequilibrium with one given marker associated with a trait will be associated with the trait. Linkage disequilibrium allows the relative frequencies in case-control populations of a limited number of genetic polymorphisms (specifically biallelic markers) to be analyzed as an alternative to screening all possible functional polymorphisms in order to find trait-causing alleles. Association studies compare the frequency of marker alleles in unrelated case-control populations, and represent powerful tools for the dissection of complex traits.

Case-Control Populations (Inclusion Criteria)

Population-based association studies do not concern familial inheritance, but compare the prevalence of a particular genetic marker, or a set of markers, in case-control populations. They are case-control studies based on comparison of unrelated case (affected or trait positive) individuals and unrelated control (unaffected, trait negative or random) individuals. Preferably, the control group is composed of unaffected or trait negative individuals. Further, the control group is ethnically matched to the case population. Moreover, the control group is preferably matched to the case-population for the main known confusion factor for the trait under study (for example age-matched for an age-dependent trait). Ideally, individuals in the two samples are paired in such a way that they are expected to differ only in their disease status. The terms “trait positive population”, “case population” and “affected population” are used interchangeably herein.

An important step in the dissection of complex traits using association studies is the choice of case-control populations (see, Lander and Schork, (1994) Science, September 30;265(5181):2037-48). A major step in the choice of case-control populations is the clinical definition of a given trait or phenotype. Any genetic trait may be analyzed by the association method proposed here by carefully selecting the individuals to be included in the trait positive and trait negative phenotypic groups. Four criteria are often useful: clinical phenotype, age at onset, family history and severity.

The selection procedure for continuous or quantitative traits (such as blood pressure for example) involves selecting individuals at opposite ends of the phenotype distribution of the trait under study, so as to include in these trait positive and trait negative populations individuals with non-overlapping phenotypes. Preferably, case-control populations consist of phenotypically homogeneous populations. Trait positive and trait negative populations consist of phenotypically uniform populations of individuals representing each between 1 and 98%, preferably between 1 and 80%, more preferably between 1 and 50%, and more preferably between 1 and 30%, most preferably between 1 and 20% of the total population under study, and preferably selected among individuals exhibiting non-overlapping phenotypes. The clearer the difference between the two trait phenotypes, the greater the probability of detecting an association with biallelic markers. The selection of those drastically different but relatively uniform phenotypes enables efficient comparisons in association studies and the possible detection of marked differences at the genetic level, provided that the sample sizes of the populations under study are significant enough.

In preferred embodiments, a first group of between 50 and 300 trait positive individuals, preferably about 100 individuals, are recruited according to their phenotypes. A similar number of trait negative individuals are included in such studies.

In the present invention, typical examples of inclusion criteria include obesity and disorders related to obesity as well as physiologic parameters associated with obesity, such as free fatty acid levels, glucose levels, insulin levels, leptin levels, triglyceride levels, free fatty acid oxidation levels, and weight loss.

Association Analysis

The general strategy to perform association studies using biallelic markers derived from a region carrying a candidate gene is to scan two groups of individuals (case-control populations) in order to measure and statistically compare the allele frequencies of the biallelic markers of the present invention in both groups.

If a statistically significant association with a trait is identified for at least one or more of the analyzed biallelic markers, one can assume that: either the associated allele is directly responsible for causing the trait (i.e. the associated allele is the trait causing allele), or more likely the associated allele is in linkage disequilibrium with the trait causing allele. The specific characteristics of the associated allele with respect to the candidate gene function usually give further insight into the relationship between the associated allele and the trait (causal or in linkage disequilibrium). If the evidence indicates that the associated allele within the candidate gene is most probably not the trait-causing allele but is in linkage disequilibrium with the real trait-causing allele, then the trait-causing allele can be found by sequencing the vicinity of the associated marker, and performing further association studies with the polymorphisms that are revealed in an iterative manner.

Association studies are usually run in two successive steps. In a first phase, the frequencies of a reduced number of biallelic markers from the candidate gene are determined in the trait positive and trait negative populations. In a second phase of the analysis, the position of the genetic loci responsible for the given trait is further refined using a higher density of markers from the relevant region. However, if the candidate gene under study is relatively small in length, as is the case for apM1, a single phase may be sufficient to establish significant associations.

Haplotype Analysis

As described above, when a chromosome carrying a disease allele first appears in a population as a result of either mutation or migration, the mutant allele necessarily resides on a chromosome having a set of linked markers: the ancestral haplotype. This haplotype can be tracked through populations and its statistical association with a given trait can be analyzed. Complementing single point (allelic) association studies with multi-point association studies also called haplotype studies increases the statistical power of association studies. Thus, a haplotype association study allows one to define the frequency and the type of the ancestral carrier haplotype. A haplotype analysis is important in that it increases the statistical power of an analysis involving individual markers.

In a first stage of a haplotype frequency analysis, the frequency of the possible haplotypes based on various combinations of the identified biallelic markers of the invention is determined. The haplotype frequency is then compared for distinct populations of trait positive and control individuals. The number of trait positive individuals, which should be, subjected to this analysis to obtain statistically significant results usually ranges between 30 and 300, with a preferred number of individuals ranging between 50 and 150. The same considerations apply to the number of unaffected individuals (or random control) used in the study. The results of this first analysis provide haplotype frequencies in case-control populations, for each evaluated haplotype frequency a p-value and an odds ratio are calculated. If a statistically significant association is found the relative risk for an individual carrying the given haplotype of being affected with the trait under study can be approximated.

Interaction Analysis

The biallelic markers of the present invention may also be used to identify patterns of biallelic markers associated with detectable traits resulting from polygenic interactions. The analysis of genetic interaction between alleles at unlinked loci requires individual genotyping using the techniques described herein. The analysis of allelic interaction among a selected set of biallelic markers with appropriate level of statistical significance can be considered as a haplotype analysis. Interaction analysis consists in stratifying the case-control populations with respect to a given haplotype for the first loci and performing a haplotype analysis with the second loci with each subpopulation.

VII. Assays for Identifying Modulators of OBG3 Polypeptide Fragment Activity

The invention features methods of screening for one or more compounds that modulate OBG3 or gOBG3 polypeptide fragment activity in cells, that includes providing potential compounds to be tested to the cells, and where modulation of an OBG3 polypeptide fragment effect or activity indicates the one or more compounds. Exemplary assays that may be used are described in the Examples 4, 7-9 and 11-14. To these assays would be added compounds to be tested for their inhibitory or stimulatory activity as compared to the effects of OBG3 polypeptide fragment alone. Other assays in which an effect is observed based on the addition of OBG3 polypeptide fragment can also be used to screen for modulators of OBG3 polypeptide fragment activity or effects of the presence of OBG3 polypeptide fragment on cells. The essential step is to apply an unknown compound and then to monitor an assay for a change from what is seen when only OBG3 polypeptide fragment is applied to the cell. A change is defined as something that is significantly different in the presence of the compound plus OBG3 polypeptide fragment compared to OBG3 polypeptide fragment alone. In this case, significantly different would be an “increase” or a “decrease” in a measurable effect of at least 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, or 75%.

The term “modulation” as used herein refers to a measurable change in an activity. Examples include, but are not limited to, lipolysis stimulated receptor (LSR) modulation, leptin modulation, lipoprotein modulation, plasma FFA levels, FFA oxidation, TG levels, glucose levels, and weight. These effects can be in vitro or preferably in vivo. Modulation of an activity can be either an increase or a decrease in the activity. Thus, LSR activity can be increased or decreased, leptin activity can be increased or decreased, and lipoprotein activity can be increased or decreased. Similarly, FFA, TG, and glucose levels (and weight) can be increased or decreased in vivo Free Fatty Acid oxidation can be increased or decreased in vivo or ex vivo.

By “LSR” activity is meant expression of LSR on the surface of the cell, or in a particular conformation, as well as its ability to bind, uptake, and degrade leptin and lipoprotein. By “leptin” activity is meant its binding, uptake and degradation by LSR, as well as its transport across a blood brain barrier, and potentially these occurrences where LSR is not necessarily the mediating factor or the only mediating factor. Similarly, by “lipoprotein” activity is meant its binding, uptake and degradation by LSR, as well as these occurrences where LSR is not necessarily the mediating factor or the only mediating factor. Exemplary assays are provided in Example 4, 7-9, and 11-14. These assay and other comparable assays can be used to determine/identify compounds that modulate OBG3 polypeptide fragment activity. In some cases it may be important to identify compounds that modulate some but not all of the OBG3 polypeptide fragment activities, although preferably all activities are modified.

The term “increasing” as used herein refers to the ability of a compound to increase an OBG3 polypeptide fragment activity in some measurable way compared to the effect of an OBG3 polypeptide fragment in its absence. As a result of the presence of the compound leptin binding and/or uptake might increase, for example, as compared to controls in the presence of the OBG3 polypeptide fragment alone. Preferably, an increase in activity is at least 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, or 75% compared to the level of activity in the presence of the OBG3 fragment .

Similarly, the term “decreasing” as used herein refers to the ability of a compound to decrease an activity in some measurable way compared to the effect of an OBG3 fragment in its absence. For example, the presence of the compound decreases the plasma concentrations of FFA, TG, and glucose in mice. Also as a result of the presence of a compound leptin binding and/or uptake might decrease, for example, as compared to controls in the presence of the OBG3 fragment alone. Preferably, an decrease in activity is at least 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, or 75% as compared to the level of activity in the presence of the OBG3 fragment alone.

The invention features a method for identifying a potential compound to modulate body mass in individuals in need of modulating body mass comprising: a) contacting a cell with a gOBG3 fragment and a candidate compound; b) detecting a result selected from the group consisting of LSR modulation, leptin modulation, lipoprotein modulation; FFA oxidation modulation; and c) wherein said result identifies said potential compound if said result differs from said result when said cell is contacted with the gOBG3 polypeptide fragment alone.

In preferred embodiments, said contacting further comprises a ligand of said LSR. Preferably said ligand is selected from the group consisting of cytokine, lipoprotein, free fatty acids, and Clq, and more preferably said cytokine is leptin, and most preferably said leptin is a leptin polypeptide fragment as described in U.S. Provisional application No. 60/155,506 hereby incorporated by reference herein in its entirety including any figures, drawings, or tables.

In other preferred embodiments, said OBG3 or gOBG3 polypeptide fragment is mouse or is human. In other preferred embodiments, said cell is selected from the group consisting of PLC, CHO-K1, Hep3B, and HepG2.

In yet other preferred embodiments, said lipoprotein modulation is selected from the group consisting of binding, uptake, and degradation. Preferably, said modulation is an increase in said binding, uptake, or degradation. Alternatively, said modulation is a decrease in said binding, uptake, or degradation.

In other preferred embodiments, leptin modulation is selected from the group consisting of binding, uptake, degradation, and transport. Preferably, said modulation is an increase in said binding, uptake, degradation, or transport. Alternatively, said modulation is a decrease in said binding, uptake, degradation, or transport. Preferably, said transport is across a blood-brain barrier.

In yet other preferred embodiments, said LSR modulation is expression on the surface of said cell. Preferably, said detecting comprises FACS, more preferably said detecting further comprises antibodies that bind specifically to said LSR, and most preferably said antibodies bind specifically to the carboxy terminus of said LSR. In still other preferred embodiments, said potential compound is selected from the group consisting of peptides, peptide libraries, non-peptide libraries, peptoids, fatty acids, lipoproteins, medicaments, antibodies, small molecules, and proteases. Other and proteases. Other characteristics and advantages of the invention are described in the Brief Description of the Figures and the Examples. These are meant to be exemplary only, and not to limit the invention in any way. Throughout this application, various publications, patents and published patent applications are cited. The disclosures of these publications, patents and published patent specifications referenced in this application are hereby incorporated by reference into the present disclosure.

EXAMPLES

The following Examples are provided for illustrative purposes and not as a means of limitation. One of ordinary skill in the art would be able to design equivalent assays and methods based on the disclosure herein all of which form part of the instant invention.

It should be noted that the term full-length OBG3 polypeptide used throughout the specification is intended to encompass the protein homologs Acrp30 [Scherer, et al., “A novel serum protein similar to C1q, produced exclusively in adipocytes”; J Biol Chem 270, 26746-26749 (1995)], AdipoQ [Hu, et al., “AdipoQ is a novel adipose-specific gene dysregulated in obesity”, J Biol Chem 271, 10697-10703 (1996)] and the human homolog apM1. [Maeda, et al., “cDNA cloning and expression of a novel adipose specific collagen-like factor, apM1 (AdiPose Most abundant Gene transcript 1)”, Biochem Biophys Res Commun 221, 286-289 (1996)] or GBP28 [Nakano, et al., “Isolation and characterization of GBP28, a novel gelatin-binding protein purified from human plasma”, J Biochem (Tokyo) 120, 803-812 (1996)]. OBG3 is also intended to encompass other homologs.

Example 1

Production of Recombinant OBG3

An exemplary method for generating recombinant OBG3 is given below. Although the method describes the production of the mouse analog, a person with skill in the art would be able to use the information provided to produce other OBG3 analogs, including but not limited to the human analog. An alignment of the amino acid sequences of the human (apM1) and mouse (AdipoQ and acrp30) OBG3 is shown in FIG. 1.

The recombinant OBG3 analog is cloned in pTRC His B (Invitrogen) between BamH1 and Xho1 (FIG. 2) and maintained in E. coli DH5-alpha. The sequence of the OBG3 insert corresponds to ACRP 30 genbank U37222 bases 88 to 791 except in position 382 where in #3 G replaces A found in ACRP 30 (V instead of M). The corresponding nucleotide in AdipoQ U49915 is G as in clone #3. The amino acid V is also conserved in the human sequence apM1. D45371.

Culture

Plate out bacteria in LB agar media containing 100 μg/mL ampicillin. Inoculate 1 colony into 5 mL media (no agar) at 37° C. overnight. Add 2 mL of this initial culture into 500 mL Erlenmeyer flasks containing 200 mL LB media and 100 μg/mL ampicillin. Incubate at 37° C. in an orbital shaker until the OD₆₀₀=0.2. Add IPTG to a final concentration of 1 mM (stock solution=1 M). Incubate at 37° C. overnight.

Lysis

Pellet the bacteria by centrifugation (Sorvall, 3500 rpm, 15 min, 4° C.) in a pre-weighed tube.

At 4° C. resuspend the pellet in 3 mL/g of lysis buffer

Add 40 μL/g PMSF 10 mM

Add 80 μL/g of lysozyme 10 mg/mL

Incubate 20 min on ice, shaking intermittently

Add 30 μL/g 10% sodium deoxycholate

Incubate at 37° C. until the lysate is viscous

Freeze in liquid Nitrogen and thaw at 37° C. three times

Sonicate 2×, 30 sec, 25% cycle, 2.5 power level

Centrifuge 30 min, 15000 rpm, 4° C.

Recover the supernatant

Note: The lysate can be stored frozen before or after the sonication step.

Batch Purification

1. Pack 1 mL of Probond resin (Invitrogen; 1 mlL=2 mL suspended gel) into a 5 mL column. Wash with 5 mL PBS.

2. Apply 5 mL bacterial supernatant to the 1 mL of gel. (If volume is very high, use several small columns.)

3. Wash with 24 mL phosphate buffer, pH 7.8, followed by a wash with 24 mL phosphate buffer, pH 6.

4. Elute with imidazole buffer and collect fractions of 1 mL.

5. Analyze fractions by OD at 280 nm or by SDS-PAGE (12.5%; dilution ½ in 2×sample buffer) under reducing conditions (100° C., 5 min)

6. Pool the fractions containing protein (usually fraction numbers 2-4 for concentrations of 0.8-1 mg/mL and fractions 1, 5 and 6 for concentrations of 0.2-0.4 mg/mL).

7. Dialyze thoroughly against 1×PBS, 24 mM ammonium bicarbonate or 50 mM Tris, pH 7.4 containing 250 nM NaCl. Concentrate by Speed-Vac if needed.

8. Analyze protein by the Lowry method.

9. Aliquot and store at −20° C.

Purification On Liquid Chromatogaphy System

1. Pack 5 mL of Probond resin into a 5 mL column.

2. Wash with 4 bed volumes of phosphate buffer pH 7.8, 1 mL/min.

3. Inject 25 mL lysate (filtered on 0.45μ or centrifuged at 3000 rpm, 30 min, 4° C., Beckman Allegra 6R) at 0.5 mL/min.

4. Wash with 4 bed volumes of phosphate buffer, pH 7.8 at 1 mL/min.

5. Wash with 12 bed volumes of phosphate buffer pH 5.5 at 1 mL/min.

6. Elute bound fraction with phosphate buffer, pH 5.5, containing 1 M imidazole at 1 mL/min.

7. Collect fractions, dialyze and analyze protein as described for batch purification, steps 7-9.

Example 2

Generation of Globular OBG3 by Enzymatic Cleavage

Incubate purified OBG3 (obtained as described above or through equivalent method) with acetylated Trypsin-Type V-S from Bovine Pancreas (Sigma E. C.=3.4.21.4) at 400 u/mg protein at 25° C. for 10 min.

Stop reaction by running the sample over a Poly-Prep Column (Biorad 731-1550) at +4° C. containing immobilized Trypsin inhibitor.

Collect 1.0 mL fractions. Determine protein concentration.

Pool the protein containing fractions and dialyze extensively against PBS using dialysis tubing with M.W. cutoff=10,000 da.

Concentrate on Amicon YM-10 Centricon Filter (Millipore, M. W. cutoff=10,000 da). Sterile filter.

Determine final protein concentration using Markwell's modified Lowry procedure (1981) or BCA protein assay (Pierce Chemical Co, Rockford, Ill.) and BSA as standard.

Check purity and efficiency of cleavage by SDS-PAGE analysis using a 4-20% gradient gel. The intact OBG3 migrates as a single band at approximately 37 kda apparently due to co-transcribed vector sequences attached to the histidine tag at the N-terminus of AdipoQ, and forms a dimer at 74 kDa. The cleaved OBG3 forms a band at approx. 18 kda (gOBG3). Additional degradation products, all smaller than 10 kda are also generated from the N-terminal region. These are separated from the desired 18 kda band by dialysis with semipermeable membranes with a MW cutoff of 10,000. The two potential cleavage sites for gOBG3 are shown in FIG. 3. The actual cleavage site has been identified as the one after amino acid 103 (amino acid 100 for human gOBG3 or apMl) (FIG. 7). That is, the N-terminus of the gOBG3 cleavage product is Lys 104 (Lys 101 for human gOBG3 or apM1).

Other enzymatic/proteolytic methods can also be used that yield similar products, e.g. clostripain. Other preferred enzymes would preferably cleave OBG3 at a site close to the junction between the collagen-like tail and the globular head (about amino acid 108 for human gOBG3 and about amino acid 111 for murine gOBG3), preferably permit the reaction to be easily stopped, preferably be easily removed using an immobilized inhibitor, or similar method, and preferably cuts the N-terminal fragment into small pieces (less than 10,000 MW). The cleavage preferably results in the presence of no more than 6 collagen repeats, more preferably 3 collagen repeats, and most preferably no collagen repeats. A collagen repeat consists of GLY-X-Y. A determination of whether an active gOBG3 has been generated can be checked using the in vitro and in vivo assays described herein (Examples 4-6, 8-10).

Example 3

Generation of gOBG3 by Recombinant Methodology

Restriction Site Cloning

A first approach is to look for unique restriction sites near the beginning of the globular head region (nucleic acid sequences of mouse and human OBG3 polypeptides are provided in the sequence listing). If present, it can be used to cleave the 5′ collagen-like region from the globular head region. If a unique site is not present, it is also possible, although more difficult, to do this using restriction enzymes that cut in more than one location by doing partial digestions. The 3′ end of the globular head can be cut from its vector backbone using an appropriate enzyme. The globular head can then be cloned into an expression vector and constructs containing the correct fragments can be identified. For AdipoQ, Tau I seems to be a unique enzyme that would separate the collagen tail from the globular head.

PCR Cloning

Another approach is to PCR the region of interest from the intact sequence (if cDNA is available) using primers with restriction sites on the end so that PCR products can be directly cloned into vectors of interest. Alternatively, gOBG3 can also be generated using RT-PCR to isolate it from adipose tissue RNA.

E. coli Vector

For example, the AdipoQ globular region can be cloned into pTrcHisB, by putting a Bam HI site on the sense oligo and a Xho I site on the antisense oligo. This allows isolation of the PCR product, digestion of that product, and ligation into the pTrcHisB vector that has also been digested with Bam HI and Xho I (FIG. 4). The vector, pTrcHisB, has an N-terminal 6-Histidine tag, that allows purification of the over expressed protein from the lysate using a Nickel resin column. The pTrcHisB vector is used for over-expression of proteins in E. coli.

Exemplary oligos for cloning into the E. coli vector include:

A) OBG3 sense CTTAGTGGATCCCGCTTATGTGTATCGCTCAG 6 base pairs from the left there is a 6 bp BamHI site. Thus the region that is homologous to the gene begins at nucleotide 13.

B) OBG3 antisense GCTGTTCTCGAGTCAGTTGGTATCATGG 6 base pairs from the left there is a 6 bp. XhoI site. Thus the region that is homologous to the gene begins at nucleotide 13.

The following are exemplary PCR conditions.

Final concentrations in the reaction are:

1×PE Biosystems buffer A

1.5 mM MgCl₂

200 uM of each dNTP (dATP, dCTP, dGTP, dTTP)

2.5 Units of Amplitaq Gold from PE Biosystems

0.4 uM of each primer (sense and antisense)

10 ng of plasmid template

Cycling parameters:

95° C. 10 min 1 cycle

95° C. 30 sec

56° C. 30 sec

72° C. 30 sec

repeat above 3 steps for 30 cycles

72° C. 7 min 1 cycle.

BAC Vector

The globular head can also be over expressed in a Baculovirus system using the 6×His Baculovirus kit (Pharmingen), for example. The AdipoQ globular region is cloned into the appropriate vector using enzymes available in the multiple cloning site. This allows over-expression of the protein in a eukaryotic system which has some advantages over the E. coli system, including: Multiple gene expression, Signal peptide cleavage, Intron splicing, Nuclear transport, Functional protein, Phosphorylation, Glycosylation, and Acylation.

Exemplary oligos for cloning into the Baculovirus vector are the following:

A). OBG3 sense CTTAGTGAATTCGCTTATGTGTATCGCTCAGA 6 base pairs from the left there is a 6 bp. EcoRI site. Thus the region that is homologous to the gene begins at nucleotide 13.

B). OBG3 antisense GCTGTTCTGCAGTCAGTTGGTATCATGG 6 base pairs from the left there is a 6 bp. PstI site. Thus the region that is homologous to the gene begins at nucleotide 13.

The following are exemplary PCR conditions.

Final concentrations in the reaction are:

1×PE Biosystems buffer A

1.5 mM MgCl₂

200 uM of each dNTP (dATP, dCTP, dGTP, dTTP)

2.5 Units of Amplitaq Gold from PE Biosystems

0.4 uM of each primer (sense and antisense)

10 ng of plasmid template

Cycling Parameters:

95° C. 10 min 1 cycle

95° C. 30 sec

60° C. 30 sec

72° C. 30 sec

repeat above 3 steps for 30 cycles

72° C. 7 min −1 cycle.

Mammalian Vector

Globular OBG3 can also be cloned into a mammalian expression vector and expressed in and purified from mammalian cells, for example 3T3-L1 cells (undifferentiated adipocyte precursors). The globular head is then generated in an environment very close to its endogenous environment. However, this is not necessarily the most efficient way to make protein.

Example 4

In Vitro Tests of Obesity-Related Activity

The activity of various preparations and various sequence variants of gOBG3 polypeptide fragments are assessed using various in vitro assays including those provided below. These assays are also exemplary of those that can be used to develop gOBG3 polypeptide fragment antagonists and agonists. To do that, the effect of gOBG3 polypeptide fragments in the above assays, e.g. on leptin and/or LSR activity, in the presence of the candidate molecules would be compared with the effect of gOBG3 polypeptide fragments in the assays in the absence of the candidate molecules. Since gOBG3 polypeptide fragments have been shown to reduce body weight in mice on a high-cafeteria diet (Example 5), these assays also serve to identify candidate treatments for reducing (or increasing) body weight.

Liver Cell Line

Tests of efficacy of gOBG3 polypeptide fragments on LSR can be performed using liver cell lines, including for example, PLC, HepG2, Hep3B (human), Hepa 1-6, BPRCL (mouse), or MCA-RH777, MCA-RH8994 (rat). For human cell lines, apM1. and globular apM1 would be used preferentially; for rodents, full-length and globular AdipoQ/Acrp30 would be used preferentially.

BPRCL mouse liver cells (ATCC Repository) were plated at a density of 300,000 cells/well in 6-well plates (day 0) in DMEM (high glucose) containing glutamine and penicillin-streptomycin (Bihain & Yen, 1992). Media was changed on day 2. On day 3, the confluent monolayers were washed once with phosphate-buffered saline (PBS, pH 7.4) (2 mL/well). Cells were incubated at 37° C. for 30 min with increasing concentrations of recombinant AdipoQ (AQ) or globular AdipoQ (AQ-GH) in DMEM containing 0.2% (w/v) BSA, 5 mM Hepes, 2 mM CaCl₂, 3.7 g/L sodium bicarbonate, pH 7.5. Incubations were continued for 3 h at 37° C. after addition of 10 ng/mL ¹²⁵I-mouse leptin (specific activity, 22100 cpm/ng). Monolayers were washed 2 times consecutively with PBS containing 0.2% BSA, followed by 1 wash with PBS/BSA, and then 2 times consecutively with PBS. Cells were lysed with 0.1 N NaOH containing 0.24 mM EDTA. Lysates were collected into tubes, and counted in a gamma-counter. Results of an exemplary experiment are shown as the mean of triplicate determinations in FIG. 5.

The results indicate that gOBG3 polypeptide fragments are at least 30% more efficient than OBG3 in increasing leptin uptake in a liver cell line (FIG. 5). This assay could be used to determine the efficiency of gOBG3 polypeptide fragments and related compounds (or agonists or antagonists) to increase or decrease leptin uptake into the liver, as well as the mechanism by which the gOBG3 polypeptide fragment/compound exerts this effect.

Blood Brain Barrier Model

The effect of gOBG3 polypeptide fragments on leptin transport in the brain can be determined using brain-derived cells. One method that is envisioned is to use the blood/brain barrier model described by Dehouck, et al., “An easier, reproducible, and mass-production method to study the blood-brain barrier in vitro”, J Neurochem 54, 1798 -1801 (1990); hereby incorporated herein by reference in its entirety (including any figures, tables, or drawings) that uses a co-culture of brain capillary endothelial cells and astrocytes to test the effects of gOBG3 polypeptide fragments on leptin (or other molecules) transport via LSR or other receptors.

This assay would be an indicator of the potential effect of gOBG3 polypeptide fragments on leptin transport to the brain and could be used to screen gOBG3 polypeptide fragment variants for their ability to modulate leptin transport through LSR or other receptors in the brain. In addition, putative agonists and antagonists of the effect of gOBG3 polypeptide fragments on leptin transport through LSR or other receptors could also be screened using this assay. Increased transport of leptin across the blood/brain barrier would presumably increase its action as a satiety factor.

FACs Analysis of LSR Expression

The effect of gOBG3 polypeptide fragments on LSR can also be determined by measuring the level of LSR expression at the cell surface by flow surface cytometry, using anti-LSR antibodies and fluorescent secondary antibodies. Flow cytometry is a laser-based technology that is used to measure characteristics of biological particles. The underlying principle of flow cytometry is that light is scattered and fluorescence is emitted as light from the excitation source strikes the moving particles.

This is a high through-put assay that could be easily adapted to screen OBG3 and gOBG3 polypeptide fragments and variants as well as putative agonists or antagonists of gOBG3 polypeptide fragments. Two assays are provided below. The antibody, cell-line and gOBG3 polypeptide fragment analog would vary depending on the experiment, but a human cell-line, human anti-LSR antibody and globular apM1. could be used to screen for variants, agonists, and antagonists to be used to treat humans.

Assay 1:

Cells are pretreated with either intact OBG3 or gOBG3 polypeptide fragments (or untreated) before harvesting and analysis by FACS. Cells are harvested using non-enzymatic dissociation solution (Sigma), and then are incubated for 1 h at 4° C. with a 1:200 dilution of anti-LSR 81B or an irrelevant anti-serum in PBS containing 1% (w/v) BSA. After washing twice with the same buffer, goat anti-rabbit FITC-conjugated antibody (Rockland, Gilbertsville, Pa.) is added to the cells, followed by a further incubation for 30 min at 4° C. After washing, the cells are fixed in 2% formalin. Flow cytometry analysis is done on a FACSCalibur cytometer (Becton-Dickinson, Franklin Lakes, N.J.).

The in vitro Liver Cell Line assay (described above) has shown that LSR activity (leptin binding) increases with increasing concentrations of gOBG3 polypeptide fragments. Whle not wishing to be bound by any particular theory, this could either be the result of an increased number of LSR binding sites on the cell surface, or a change in affinity for leptin. The FACS assay would presumably be detecting changes in the number of LSR binding sites, although changes in conformation reflecting changes in affinity might also be detected. Preferably the antibody would be to the C-terminus of LSR.

Assay 2:

Cells are cultured in T175 flasks according to manufacturer's instructions for 48 hours prior to analysis.

Cells are washed once with FACs buffer (1×PBS/2% FBS, filter sterilized), and manually scraped from the flask in 10 mLs of FACs buffer. The cell suspension is transferred to a 15 mL conical tube and centrifuged at 1200 rpm, 4° C. for 5 minutes. Supernatant is discarded and cells are resuspended in 10 mL FACs buffer chilled to 4° C. A cell count is performed and the cell density adjusted with FACs buffer to a concentration of 1×10⁶ cells/mL. One milliliter of cell suspension was added to each well of a 48 well plate for analysis. Cells are centrifuged at 1200 rpm for 5 minutes at 4° C. Plates are checked to ensure that cells are pelleted, the supernatant is removed and cells resuspended by running plate over a vortex mixer. One milliliter of FACs buffer is added to each well, followed by centrifugation at 1200 rpm for 5 minutes at 4° C. This described cell washing was performed a total of 3 times.

Primary antibody, titered in screening experiments to determine proper working dilutions (for example 1:25, 1:50, 1:100, 1:200, 1:400, 1:500, 1:800, 1:1000, 1:2000, 1:4000, 1:5000, or 1:10000), is added to cells in a total volume of 50 μL FACs buffer. Plates are incubated for 1 h at 4° C. protected from light. Following incubation, cells are washed 3 times as directed above. Appropriate secondary antibody, titered in screening experiments to determine proper working dilutions (for example 1:25, 1:50, 1:100, 1:200, 1:400, 1:500, 1:800, 1:1000, 1:2000, 1:4000, 1:5000, or 1:10000), is added to cells in a total volume of 50 μL FACs buffer. Plates are incubated for 1 h at 4° C. protected from light. Following incubation, cells are washed 3 times as directed above. Upon final wash, cells are resuspended in 500 μL FACs buffer and transfered to a FACs acquisition tube. Samples are placed on ice protected from light and analyzde within 1 hour.

Cellular Binding and Uptake of gOBG-3 as Detected by Fluorescence Microscopy

Fluorecein isothiocyanate (FITC) conjugation of gOBG3: Purified gOBG3 at 1 mg/mL concentration was labeled with FITC using Sigma's FluoroTag FITC conjugation kit (Stock No. FITC-1). Protocol outlined in the Sigma Handbook for small scale conjugation was followed for gOBG3 labeling.

Cell Culture: C2C12 mouse skeletal muscle cells (ATCC, Manassas, Va. CRL-1772) and Hepa-1-6 mouse hepatocytes (ATCC, Manassas, Va. CRL-1830) were seeded into 6 well plates at a cell density of 2×10⁵ cells per well. C2C12 and Hepa-1-6 cells were cultured according to repository's instructions for 24-48 hours prior to analysis. Assay was performed when cells were 80% confluent.

FITC labeled gOBG3 cellular binding and uptake using microscopy: C2C12 and Hepa 1-6 cells were incubated in the presence/absence of antibody directed against human LSR (81B: N-terminal sequence of human LSR; does not cross react with mouse LSR and 93A: c-terminal sequence, cross reacts with mouse LSR) or an antiserum directed against gC1qr (953) for 1 hour at 37° C., 5% CO2. LSR antibodies were added to the media at a concentration of 2 μg/mL. The anti-gC1qr antiserum was added to the media at a volume of 2.5 μL undiluted serum (high concentration) or 1:100 dilution (low concentration). Following incubation with specified antibody, FITC-gOBG3 (50 nM/mL) was added to each cell culture well. Cells were again incubated for 1 hour at 37° C., 5% CO2. Cells were washed 2× with PBS, cells were scraped from well into 1 mL of PBS. Cell suspension was transferred to an eppendorf tube and centrifuged at 1000 rpm for 2 minutes. Supernatant was removed and cells resuspended in 200 μL of PBS. Binding and uptake of FITC-gOBG3 was analyzed by fluorescence microscopy under 40× magnification.

Analysis of C2C12 and Hepa 1-6 cells reveals identical phenotypes with respect to FITC-gOBG3 binding and uptake profiles both in the presence or absence of LSR antibodies. FITC-gOBG3 appears to be localized within vesicles in the cytoplasm of both mouse hepatocytes and mouse myoblasts, suggesting that binding and uptake of FITC-gOBG3 is occurring. FITC-gOBG3 uptake appears to be blocked when cells were pre-treated with the anti-LSR antibody that recognizes mouse LSR. However, binding of FITC-gOBG3 to the cell surface does occur in a small portion of the cells (C2C12 and Hepa 1-6). At low concentration of the gC1qr antiserum, FITC-gOBG3 appears to be localized within vesicles in the cytoplasm of both cell types, similarly to the phenotype of cells that have not received antibody pre-treatment prior to addition of FITC-gOBG3. FITC-gOBG3 uptake and binding phenotype is not affected by pre-treatment with an LSR antibody that does not recognize mouse LSR. Together, these data suggest that uptake of FITC-gOBG3 can be blocked by a human LSR antibody which cross-reacts with mouse LSR. However, this phenotype is not reproduced with other non cross-reactive LSR antibodies. Thus, this assay may be useful for identifying agents that facilitate or prevent the uptake and/or binding of OBG3 or gOBG3 polypeptide fragments to cells.

Effect on LSR as a Lipoprotein Receptor

The effect of gOBG3 on the lipoprotein binding, internalizing and degrading activity of LSR can also be tested. Measurement of LSR as lipoprotein receptor is described in Bihain & Yen, ((1992) Biochemistry May 19;31(19):4628-36; hereby incorporated herein in its entirety including any drawings, tables, or figures). The effect of gOBG3 on the lipoprotein binding, internalizing and degrading activity of LSR (or other receptors) can be compared with that of intact OBG3, with untreated cells as an additional control. This assay can also be used to screen for active and inhibitory variants of gOBG3, as well as agonists and antagonists of obesity-related activity.

Human liver PLC cells (ATCC Repository) were plated at a density of 300,000 cells/well in 6-well plates (day 0) in DMEM (high glucose) containing glutamine and penicillin-streptomycin (Bihain & Yen, 1992). Media was changed on day 2. On day 3, the confluent monolayers were washed once with phosphate-buffered saline (PBS, pH 7.4) (2 mL/well). Cells were incubated at 37° C. for 30 min with 10 ng/mL human recombinant leptin in DMEM containing 0.2% (w/v) BSA, 5 mM Hepes, 2 mM CaCl₂, 3.7 g/L sodium bicarbonate, pH 7.5, followed by another 30 min incubation at 37° C. with increasing concentrations of gOBG3. Incubations were continued for 2 h at 37° C. after addition of 0.8 mM oleate and 20 μg/mL ¹²⁵I-LDL. Monolayers were washed 2 times consecutively with PBS containing 0.2% BSA, followed by I wash with PBS/BSA, and then 2 times consecutively with PBS. The amounts of oleate-induced binding, uptake and degradation of ¹²⁵I-LDL were measured as previously described (Bihain & Yen, 1992, supra). Results are shown as the mean of triplicate determinations.

As shown in FIG. 6, the addition of gOBG3 leads to an increased activity of LSR as a lipoprotein receptor. The oleate-induced binding and uptake of LDL appears more affected by gOBG3 as compared to the degradation. This increased LSR activity would potentially result in an enhanced clearance of triglyceride-rich lipoproteins during the postprandial state. Thus, more dietary fat would be removed through the liver, rather than being deposited in the adipose tissue.

This assay could be used to determine the efficiency of a compound (or agonists or antagonists) to increase or decrease LSR activity (or lipoprotein uptake, binding and degradation through other receptors), and thus affect the rate of clearance of triglyceride-rich lipoproteins.

Effect on Muscle Differentiation

C2C12 cells (murine skeletal muscle cell line; ATCC CRL 1772, Rockville, Md.) are seeded sparsely (about 15-20%) in complete DMEM (w/glutamine, pen/strep, etc)+10% FCS. Two days later they become 80-90% confluent. At this time, the media is changed to DMEM+2% horse serum to allow differentiation. The media is changed daily. Abundant myotube formation occurs after 3-4 days of being in 2% horse serum, although the exact time course of C2C12 differentiation depends on how long they have been passaged and how they have been maintained, among other things.

To test the effect of the presence of gAcrp30 on muscle differentiation, gAcrp30 (1 to 2.5 μg/mL) was added the day after seeding when the cells were still in DMEM w/10% FCS. Two days after plating the cells (one day after gAcrp30 was first added), at about 80-90% confluency, the media was changed to DMEM+2% horse serum plus gAcrp30.

The results show that the addition of gAcrp30 causes the cells to begin organizing within one day after its addition. In contrast to the random orientation of the cells not treated with gAcrp30, those treated with gAcrp30 aligned themselves in relation to each other. In addition, differentiation occurred after only 2 days of gAcrp30 treatment, in contrast to the 3 to 4 days needed in its absence.

Effect on Muscle Cell Fatty Acid Oxidation

C2C12 cells were differentiated in the presence or absence of 2 μg/mL gAcrp30 for 4 days. On day 4, oleate oxidation rates were determined by measuring conversion of 1¹⁴C-oleate (0.2 mM) to ¹⁴CO₂ for 90 min. C2C12 cells differentiated in the presence of gAcrp30 undergo 40% more oleate oxidation than controls differentiated in the absence of gAcrp30. This experiment can be used to screen for active fragments and peptides as well as agonists and antagonists or activators and inhibitors of OBG3 and gOBG3 polypeptides.

The effect of gAcrp30 on the rate of oleate oxidation was compared in differentiated C2C12 cells (murine skeletal muscle cells; ATCC, Manassas, Va. CRL-1772) and in a hepatocyte cell line (Hepa1-6; ATCC, Manassas, Va. CRL-1830). Cultured cells were maintained according to manufacturer's instructions. The oleate oxidation assay was performed as previously described (Muoio et al. (1999) Biochem J 338;783-791). Briefly, nearly confluent myocytes were kept in low serum differentiation media (DMEM, 2.5% Horse serum) for 4 days, at which time formation of myotubes became maximal. Hepatocytes were kept in the same DMEM medium supplemented with 10% FCS for 2 days. One hour prior to the experiment the media was removed and 1 mL of preincubation media (MEM, 2.5% Horse serum, 3 mM glucose, 4 mM Glutamine, 25 mM Hepes, 1% FFA free BSA, 0.25 mM Oleate, 5 μg/mL gentamycin) was added. At the start of the oxidation experiment ¹⁴C-Oleic acid (1 μCi/mL, American Radiolabeled Chemical Inc., St. Louis, Mo.) was added and cells were incubated for 90 min at 37° C. in the absence/presence of 2.5 μg/mL gAcrp30. After the incubation period 0.75 mL of the media was removed and assayed for ¹⁴C-oxidation products as described below for the muscle FFA oxidation experiment.

Oleate oxidation in C2C12 cells determined over 90 min increased significantly (39%; p=0.036, two-tailed t-Test) in cells treated with gAcrp30. In contrast, no detectable increase in the rate of FFA oxidation was seen in hepatocytes incubated with gAcrp30.

Triglyceride and Protein Analysis Following Oleate Oxidaiton in Cultured Cells

Following transfer of media for oleate oxidation assay, cells were placed on ice. To determine triglyceride and protein content, cells were washed with 1 mL of 1×PBS to remove residual media. To each well 300 μL of cell dissociation solution (Sigma) was added and incubated at 37° C. for 10 min. Plates were tapped to loosen cells, and 0.5 mL of 1×PBS was added. The cell suspension was transferred to an eppendorf tube, each well was rinsed with an additional 0.5 mL of 1×PBS, and was transferred to appropriate eppendorf tube. Samples were centrifuged at 1000 rpm for 10 minutes at room temperature. Supernatant was discarded and 750 μL of 1×PBS/2% chaps was added to cell pellet. Cell suspension was vortexed and place on ice for 1 hour. Samples were then centrifuged at 13000 rpm for 20 min at 4° C. Supernatants were transferred to new tube and frozen at −20° C. until analyzed. Quantitative measure of triglyceride level in each sample was determined using Sigma Diagnostics GPO-TRINDER enzymatic kit. The procedure outlined in the manual was adhered to, with the following exceptions: assay was performed in 48 well plate, 350 μL of sample volume was assayed, control blank consisted of 350 μL PBS/2% chaps, and standard contained 10 μL standard provide in kit plus 690 μL PBS/2% chaps. Analysis of samples was carried out on a Packard Spectra Count at a wavelength of 550 nm. Protein analysis was carried out on 25 μL of each supernatant sample using the BCA protein assay (Pierce) following manufacturer's instructions. Analysis of samples was carried out on a Packard Spectra Count at a wavelength of 550 nm.

Triglyceride production in both C2C12 and Hepa 1-6 cells did not change significantly in the absence/presence of Acrp30 and gAcrp30. The protein content of all cells analyzed was equivalent in the absence/presence of Acrp30 and gAcrp30.

Example 5

Effect of gOBG3 on Mice Fed a High-Fat Diet

Experiments are performed using approximately 6 week old C57B1/6 mice (8 per group). All mice are housed individually. The mice are maintained on a high fat diet throughout each experiment. The high fat diet (cafeteria diet; D12331 from Research Diets, Inc.) has the following composition: protein kca1% 16, sucrose kca1% 26, and fat kca1% 58. The fat was primarily composed of coconut oil, hydrogenated.

After the mice are fed a high fat diet for 6 days, micro-osmotic pumps are inserted using isoflurane anesthesia, and are used to provide gOBG3, OBG3, saline, and an irrelevant peptide to the mice subcutaneously (s.c.) for 18 days. gOBG3 is provided at doses of 50, 25, and 2.5 μg/day; OBG3 is provided at 100, 50, and 5 μg/day; and the irrelevant peptide is provided at 10 μg/day. Body weight is measured on the first, third and fifth day of the high fat diet, and then daily after the start of treatment. Final blood samples are taken by cardiac puncture and are used to determine triglyceride (TG), total cholesterol (TC), glucose, leptin, and insulin levels. The amount of food consumed per day is also determined for each group.

In a preliminary experiment, mice treated with 2.5 μg/day gOBG3 had significantly lowered body weight.

Example 6

Tests of Obesity-Related Activity in Humans

Tests of the efficacy of gOBG3 in humans are performed in accordance with a physician's recommendations and with established guidelines. The parameters tested in mice are also tested in humans (e.g. food intake, weight, TG, TC, glucose, insulin, leptin, FFA). It is expected that the physiological factors would show changes over the short term. Changes in weight gain might require a longer period of time. In addition, the diet would need to be carefully monitored. Globular OBG3 would be given in daily doses of about 6 mg protein per 70 kg person or about 10 mg per day. Other doses would also be tested, for instance 1 mg or 5 mg per day up to 20 mg, 50 mg, or 100 mg per day.

Example 7

Tests of Obesity-related Activity in a Murine Lipoatrophic Diabetes Model

Previously, leptin was reported to reverse insulin resistance and diabetes mellitus in mice with congenital lipodystrophy (Shimomura et al. Nature 401: 73-76 (1999); hereby incorporated herein in its entirety including any drawings, figures, or tables). Leptin was found to be less effective in a different lipodystrophic mouse model of lipoatrophic diabetes (Gavrilova et al. Nature 403: 850 (2000); hereby incorporated herein in its entirety including any drawings, figures, or tables). The instant invention encompasses the use of OBG3 or gOBG3 polypeptide fragments for reducing the insulin resistance and hyperglycaemia in this model either alone or in combination with leptin, the leptin peptide (U.S. provisional application No. 60/155,506), or other compounds. Assays include that described previously in Gavrilova et al. ((2000) Diabetes November;49(11):1910-6; (2000) Nature February 24;403(6772):850) using A-ZIP/F-1 mice, except that gOBG3 would be administered using the methods previously described in Example 5 (or Examples 8-10). The glucose and insulin levels of the mice would be tested, and the food intake and liver weight monitored, as well as other factors, such as leptin, FFA, and TG levels, typically measured in our experiments (see Example 5, above, or Examples 8-10).

Example 8

Effect of gOBG-3 on plasma Free Fatty Acid in C57 BL/6 Mice

The effect of the globular head of Acrp30 on postprandial lipemia (PPL) in normal C57BL6/J mice was tested. Acrp30 is another name for adipo Q and is the mouse protein homologue to the human apM1 protein. OBG3 is a generic way to refer to all of these forms. The globular head form is indicated by placing a ‘g’ in front, e.g. g-acrp30 or gOBG3. The gOBG3 used was prepared by proteolytic digestion of recombinant OBG3 as described previously in Example 2. Acetylated trypsin was used as protease.

The mice used in this experiment were fasted for 2 hours prior to the experiment after which a baseline blood sample was taken. All blood samples were taken from the tail using EDTA coated capillary tubes (50 μL each time point). At time 0 (8:30 AM), a standard high fat meal (6 g butter, 6 g sunflower oil, 10 g nonfat dry milk, 10 g sucrose, 12 mL distilled water prepared fresh following Nb#6, JF, pg.1) was given by gavage (vol.=1% of body weight) to all animals.

Immediately following the high fat meal, 25 μg gOBG3 was injected i.p. in 100 μL saline. The same dose (25 μg/mL in 100 μL) was again injected at 45 min and at 1 hr 45 min (treated group, n=8). Control animals (n=8) were injected with saline (3×100 μL). Untreated and treated animals were handled in an alternating mode.

Blood samples were taken in hourly intervals, and were immediately put on ice. Plasma was prepared by centrifugation following each time point. Plasma was kept at −20° C. and free fatty acids (FFA), triglycerides (TG) and glucose were determined within 24 hours using standard test kits (Sigma and Wako). Due to the limited amount of plasma available, glucose was determined in duplicate using pooled samples. For each time point, equal volumes of plasma from all 8 animals per treatment group were pooled. Error bars shown for glucose therefore represent the SD of the duplicate determination and not the variation between animals as for TG and FFA.

Results

The increase in plasma FFA due to the high fat meal was significantly lower in mice treated with gOBG3 at all time points between 1 and 4 hr. This can be interpreted as increase in FFA oxidation (FIG. 8).

Treatment with gOBG3 also led to a significantly smaller increase in plasma TG compared to untreated mice. However, this effect was less pronounced than the effect on FFA (FIG. 9).

Glucose turnover was significantly improved following treatment with gOBG3; this effect can be interpreted as improved insulin sensitivity possibly due to the decrease in FFA (FIG. 10).

Similar results were seen previously in a prior experiment involving only 2 treatments (at 0 and at 45 minutes; data not shown). A strong FFA lowering effect of gOBG3 coupled with a less dominant TG lowering effect was observed.

Example 9

Effect of gOBG-3 on Plasma Leptin and Insulin in C57 BL/6 Mice

The effect of the globular head of Acrp30 on plasma leptin and insulin levels during postprandial lipemia (PPL) in normal C57BL6/J mice was tested. The experimental procedure was the same as that described in Example 8, except that blood was drawn only at 0, 2 and 4 hours to allow for greater blood samples needed for the determination of leptin and insulin by RIA.

Briefly, 16 mice were fasted for 2 hours prior to the experiment after which a baseline blood sample was taken. All blood samples were taken from the tail using EDTA coated capillary tubes (100 μL each time point). At time 0 (9:00 AM), a standard high fat meal (see Example 8) was given by gavage (vol.=1% of body weight) to all animals. Immediately following the high fat meal, 25 μg gOBG3 was injected i.p. in 100 μL saline. The same dose (25 μg in 100 μL) was again injected at 45 min and at 1 hr 45 min (treated group, n=8). Control animals (n=8) were injected with saline (3×100 μL). Untreated and treated animals were handled in an alternating mode.

Blood samples were immediately put on ice and plasma was prepared by centrifugation following each time point. Plasma was kept at −20° C. and free fatty acids (FFA) were determined within 24 hours using a standard test kit (Wako). Leptin and Insulin were determined by RIA (ML-82K and SRI-13K, LINCO Research, Inc., St. Charles, Mo.) following the manufacturer's protocol. However, only 20 μL plasma was used. Each determination was done in duplicate. Due to the limited amount of plasma available, leptin and insulin were determined in 4 pools of 2 animals each in both treatment groups.

Results

As shown previously (Example 8), treatment with gOBG3 significantly reduced the postprandial increase in plasma FFA caused by the high fat meal at 2 hours (FIG. 11). There was no significant change in plasma leptin levels at any time point; treatment with gOBG3 did not affect leptin levels (FIG. 12). Insulin levels (FIG. 13) indicate a marginal increase in insulin at 2 hours. However, when analyzed as percentage change from to, this increase (212% vs. 260%, control vs. treated) was statistically not significant (p=0.09).

These data reconfirm the previously shown acceleration of FFA metabolism by treatment with gOBG3. They also show that gOBG3 does not affect leptin and insulin plasma levels and that gOBG3 reduces hyperglycemia during postprandial lipemia and also induces weight loss during treatment over several days. Without being limited by any particular theory, the data suggests: a) that the reduction in weight is caused by a leptin independent increase in metabolism; and b) that gOBG3 leads to increased insulin sensitivity.

Example 10

Effect of OBG-3 on Plasma FFA, TG and Glucose in C57 BL/6 Mice

The effect of the globular head of acrp30 on plasma FFA, TG, glucose, leptin and insulin levels during postprandial lipemia (PPL) in normal C57BL6/J mice has been described. Weight loss resulting from gOBG3 (2.5 μg/day) given to normal C57BL6/J mice on a high fat diet has also been shown (Example 5). In comparison, a much higher dose of the complete form of acrp30 (200 μg/day) was needed to induce a relatively smaller effect in mice. This example shows the effect of the acrp30-complete form on plasma FFA, TG and glucose levels.

The experimental procedure was similar to that described in Example 8. Briefly, 14 mice were fasted for 2 hours prior to the experiment after which a baseline blood sample was taken. All blood samples were taken from the tail using EDTA coated capillary tubes (50 μL each time point). At time 0 (9:00 AM), a standard high fat meal (see Example 8) was given by gavage (vol.=1% of body weight) to all animals. Immediately following the high fat meal, 4 mice were injected 25 μg OBG3 i.p. in 100 μL saline. The same dose (25 μg in 100 μL) was again injected at 45 min and at 1 hr 45 min. A second treatment group (n=4) received 3 times 50 μg OBG3 at the same intervals. Control animals (n=6) were injected with saline (3×100 μL). Untreated and treated animals were handled in an alternating mode.

Blood samples were immediately put on ice. Plasma was prepared by centrifugation following each time point. Plasma was kept at −20° C. and free fatty acids (FFA), triglycerides (TG) and glucose were determined within 24 hours using standard test kits (Sigma and Wako).

Results

Treatment with full length OBG3 had no effect on plasma FFA levels (FIG. 14) except for t=2 hours when a statistically significant reduction was shown (p<0.05). No significant change in postprandial TG (FIG. 15) and glucose levels (FIG. 16) was seen in treated animals.

The data presented show that the complete form of OBG3 did not reduce FFA, TG and glucose levels in contrast to what was observed for the globular region (Examples 5, 8, 9). Only at 2 hours post-gavage, did treatment with OBG3 reduce FFA plasma concentrations significantly (p<0.05). These results demonstrate that gOBG3 is much more active in vivo than the full length protein. A similar effect was seen for body weight reduction; the globular head was much more active than the full-length protein.

Example 11

Effect of gAcrp30 on FFA Following Epinephrine Injection

In mice, plasma free fatty acids increase after intragastric administration of a high fat/sucrose test meal. These free fatty acids are mostly produced by the activity of lipolytic enzymes i.e. lipoprotein lipase (LPL) and hepatic lipase (HL). In this species, these enzymes are found in significant amounts both bound to endothelium and freely circulating in plasma¹⁶. Another source of plasma free fatty acids is hormone sensitive lipase (HSL) that releases free fatty acids from adipose tissue after β-adrenergic stimulation. To test whether gAcrp30 also regulates the metabolism of free fatty acid released by HSL, mice were injected with epinephrine.

Two groups of mice (n=5 each) were given epinephrine (5 μg) by intraperitoneal injection. A treated group was injected with gAcrp30 (25 μg) one hour before and again together with epinephrine, while control animals received saline. Plasma was isolated and free fatty acids and glucose were measured as described above (Example 10). As shown in FIG. 18, epinephrine injections (5 μg) caused an increase in plasma free fatty acids and glucose. Both effects were significantly reduced in gAcrp30-treated mice.

This reduction in the increases of glucose and FFA levels was not due to blockage of the β-adrenergic effect of epinephrine, as shown by inducing the release of FFA from isolated adipose tissue in vitro. In these control studies, adipose tissue was removed from normal C57BL/6J mice and incubated in Krebs-Henseleit bicarbonate buffer. Epinephrine was added and the concentration of FFA in the medium following a 90 min incubation was determined. Epinephrine (10 μM) caused a 1.7-fold increase in free fatty acids in the media. Increasing concentrations of gAcrp30 or Acrp30 up to 50 μg/ml did not inhibit this effect of epinephrine.

The data presented thus far indicate that the globular region of Acrp30 exerts profound pharmacological effects on the metabolism of energy substrates with the most evident effect on plasma free fatty acids. Further, the reduction in plasma FFA concentration cannot be explained by inhibition of either LPL—this would cause an increase in plasma triglycerides while a decrease of plasma triglycerides is actually observed—or by inhibition of HSL. Thus, the simplest explanation is that gAcrp30 causes increased removal of free fatty acids from the circulation by promoting cellular uptake.

Example 12

Effect of gAcrp30 on Muscle FFA Oxidation

To investigate the effect of gAcrp30 on muscle free fatty acid oxidation, intact hind limb muscles from C57BL/6J mice were isolated and FFA oxidation was measured using oleate as substrate [Clee, et al., “Plasma and vessel wall lipoprotein lipase have different roles in atherosclerosis”, J Lipid Res 41, 521-531 (2000); Muoio, et al., “Leptin opposes insulin's effects on fatty acid partitioning in muscles isolated from obese ob/ob mice”, Am J Physiol 276, E913-921 (1999). Oleate oxidation in isolated muscle was measured as previously described [Cuendet, et al., “Decreased basal, noninsulin-stimulated glucose uptake and metabolism by skeletal soleus muscle isolated from obese-hyperglycemic (oblob) mice”, J Clin Invest 58, 1078-1088 (1976); Le Marchand-Brustel, et al., “Insulin binding and effects in isolated soleus muscle of lean and obese mice”, Am J Physiol 234, E348-E358 (1978)]. Briefly, mice were sacrificed by cervical dislocation and soleus and EDL muscles were rapidly isolated from the hind limbs. The distal tendon of each muscle was tied to a piece of suture to facilitate transfer among different media. All incubations were carried out at 30° C. in 1.5 mL of Krebs-Henseleit bicarbonate buffer (118.6 mM NaCl, 4.76 mM KCl, 1.19 mM KH₂PO₄, 1.19 mM MgSO₄, 2.54 mM CaCl₂, 25 mM NaHCO₃, 10 mM Hepes, pH 7.4) supplemented with 4% FFA free bovine serum albumin (fraction V, RIA grade, Sigma) and 5 mM glucose (Sigma). The total concentration of oleate (Sigma) throughout the experiment was 0.25 mM. All media were oxygenated (95% O₂; 5% CO₂) prior to incubation. The gas mixture was hydrated throughout the experiment by bubbling through a gas washer (Kontes Inc., Vineland, N.J.).

Muscles were rinsed for 30 min in incubation media with oxygenation. The muscles were then transferred to fresh media (1.5 mL) and incubated at 30° C. in the presence of 1 μCi/mL [1-¹⁴C] oleic acid (American Radiolabeled Chemicals). The incubation vials containing this media were sealed with a rubber septum from which a center well carrying a piece of Whatman paper (1.5 cm×11.5 cm) was suspended.

After an initial incubation period of 10 min with constant oxygenation, gas circulation was removed to close the system to the outside environment and the muscles were incubated for 90 min at 30° C. At the end of this period, 0.45 mL of Solvable (Packard Instruments, Meriden, Conn.) was injected onto the Whatman paper in the center well and oleate oxidation by the muscle was stopped by transferring the vial onto ice.

After 5 min, the muscle was removed from the medium, and an aliquot of 0.5 mL medium was also removed. The vials were closed again and 1 mL of 35% perchloric acid was injected with a syringe into the media by piercing through the rubber septum. The Colo.₂ released from the acidified media was collected by the Solvable in the center well. After a 90 min collection period at 30° C., the Whatman paper was removed from the center well and placed in scintillation vials containing 15 mL of scintillation fluid (HionicFlour, Packard Instruments, Meriden, Conn.). The amount of ¹⁴C radioactivity was quantitated by liquid scintillation counting. The rate of oleate oxidation was expressed as nmol oleate produced in 90 min/g muscle.

To test the effect of gAcrp30 or Acrp30 on oleate oxidation, these proteins were added to the media at a final concentration of 2.5 μg/mL and maintained in the media throughout the procedure.

Two muscles of different oxidative capacity (soleus and extensor digitorum longus (EDL)) were tested (FIG. 19). EDL and Soleus muscles were isolated from both legs of normal C57BL/6J mice (n=18). One muscle of each pair was incubated in medium with 2.5 μg/mL gAcrp30 (dark gray) and one in medium without gAcrp30 (control—light gray). This experimental design allowed us to compare oleate oxidation in pairs of muscles isolated from the same animal. ¹⁴C-Oleate oxidation was determined over 90 minutes. Incubation of EDL and soleus muscles for 90 minutes in medium containing 2.5 μg/ml gAcrp30 leads to a statistically significant increase in oleate oxidation (p<0.05, paired, one-tailed, t-Test) or (p=0.0041, Repeated Measures Analysis of Variance, Univariate Tests of Hypotheses for Within Subject Effects) in both muscle types.

Both muscle types showed a significant response to gAcrp30. The relative increase in FFA oxidation was 17% (p=0.03) and 10% (p=0.04) for EDL and soleus, respectively. In humans, muscles represent approximately 25% of body weight. Therefore, even a moderate increase in free fatty acid oxidation can have quantitatively important consequences on overall energy utilization.

Example 13

Effect of gArcp30 on Triglyceride in Muscle & Liver Isolated from Mice

To determine whether the increased FFA oxidation induced by gAcrp30 is also accompanied by increased FFA delivery into muscle or liver, the hindlimb muscle and liver triglyceride content was measured after gAcrp30 treatment of mice. Hind limb muscles as well as liver samples were removed from treated and untreated animals and the triglyceride and free fatty acid concentration was determined following a standard lipid extraction method Shimabukuro, et al., “Direct antidiabetic effect of leptin through triglyceride depletion of tissues”, Proc Natl Acad Sci U S A 94, 4637-4641 (1997) followed by TG and FFA analysis using standard test kits.

Short-term treatment of animals with gAcrp30 (2 injections of 25 μg each given within 3 hours before sacrifice) did not change the triglyceride content either of hind limb muscle or liver tissue (data not shown). However, after 3 days of treatment, during which period normal C57BL/6J mice consumed a regular rodent diet, mice that had received 25 μg of gAcrp30 twice daily showed significantly higher (p=0.002) muscle triglyceride content (FIG. 20A) than those receiving saline (control: light gray; gAcrp30: dark gray). This contrasted with a lack of increase in liver triglycerides (FIG. 20B). Furthermore, no detectable increase in muscle TG was observed after the 16-day treatment shown independently by directly measuring the muscle TG content and by oil red O staining of frozen microscope sections. In summary, the data indicate that the increase in TG content was transient.

These data are consistent with the notion that gAcrp30 increases the rate of removal of free fatty acids from plasma at least partly by increasing their delivery to the muscle; much of the FFAs are immediately oxidized while some are stored as triglycerides and subsequently oxidized. Further support for this interpretation was obtained by measuring the concentration of ketone bodies in plasma of treated and untreated animals following a high fat/sucrose meal.

Ketone bodies (KB) are produced in the liver as a result of free fatty acid oxidation, but KB formation does not occur significantly in muscle. In mice receiving the high fat test meal and saline injection, the level of plasma KB increased significantly over the next 3 hours (183±12%, n=6). Animals treated with gAcrp30, on the other hand, showed no increase in plasma KB concentrations. Thus, gAcrp30 inhibits either directly KB formation or can decrease KB production by inhibiting liver FFA oxidation.

Example 14

Effect of gAcrp30 on Weight Gain & Weight Loss of Mice

Two independent studies showed that gAcrp30 also affects overall energy homeostasis. In the first, 10-week-old male C57BL/6J mice were put on a very high fat/sucrose purified diet for 19 days to promote weight gain (see Example 5); the average body weight at this time was 30 g. The mice were then surgically implanted with an osmotic pump (Alzet, Newark, Del.) delivering either 2.5 μg/day of gAcrp30, 5 μg/day of Acrp30, or physiological saline. The mice were continued on the high fat diet and their body weight was recorded over the following 10-day period.

Mice treated with saline or 5 μg/day of full length Acrp30 continued to gain weight at an average daily rate of 0.16% and 0.22%, respectively. In contrast, mice treated with gAcrp30 experienced a significant weight reduction (−3.7%, p=0.002) during the first 4 days and then their weight remained constant (FIG. 21A). Thus, in this inbred strain of normal mice, a continuous infusion of a daily low dose of gAcrp30 can prevent weight gain caused by high fat/sucrose feeding, in a sustainable way.

This result was confirmed and extended in a second study performed in mature 9 month old, male obese C57BL/6J mice that had been on the same high fat/sucrose diet for 6 months; the average body weight when the study began was 52.5±0.8 g. Three groups of 8 mice were treated with saline, Acrp30 or gAcrp30 for 16 days. Animals in the treated group received twice daily 25 μg of protein subcutaneously. Body weights were recorded at the indicated time points.

Treatment with gAcrp30 led to significant (p<0.05) weight loss at day 3. This effect became even more significant as the study continued. During the 16 day study period, the obese C57BL/6J mice that received gAcrp30 lost about 8% (p=0.001) of their initial body weight despite the fact that they were maintained on a high fat/sucrose diet (FIG. 21B). Saline treated animals showed only marginal fluctuations in their body weight (p=n.s.). Animals treated with the full length Acrp30, but at a 10-fold higher dose than that used in the first experiment, also lost significant weight (−3.2%, p=0.025). Interestingly, mice treated with gAcrp30 continued to lose weight at a steady rate during the 16-day study period, while the rate of weight reduction in those treated with the full length Acrp30 decreased during the later phase of the study. Food consumption in gAcrp30 treated animals was not significantly different from saline or Acrp30 treated animals (FIG. 21D).

Treatment with gAcrp30 caused a significant reduction in the concentration of plasma free fatty acids (FIG. 21C). This effect was significant after 3 days of treatment (p<0.05 vs. saline) and continued throughout the complete study period. Shown is the plasma FFA level at day 16 of the study. The initial FFA plasma concentration was the same in all three treatment groups. It should be noted, however, that despite this reduction the plasma free fatty acid concentration of these massively obese animals remains about 40-60% higher than that of normal mice. A blood chemistry analysis (including determination of SGPT, SGOT, urea, creatinine or bilirubin) performed on the terminal blood samples did not reveal any abnormal plasma parameters (FIG. 22).

Data are expressed throughout as mean±SEM; a p-value<0.05 was considered statistically significant. Statistical analysis was typically done using either the unpaired Student's t test or the paired Student's t test, as indicated in each study.

Example 15

Detection of apM1. (gOBG3) Fragment in Human Plasma After Immunoprecipitation

The recombinant form of Acrp30 protein used has an apparent molecular weight of 37 kDa and forms a dimer of 74 kDa (FIG. 23A, Lane II). A proteolytic fragment that contains the entire globular head region (gAcrp30) and that migrates with an apparent molecular weight of 18 kDa was generated using acetylated trypsin (FIG. 23A, lane I). Both protein preparations (Acrp30 and gAcrp30) were essentially endotoxin free; ActiClean Etox affinity columns (Sterogene Bioseparations Inc., Carlsbad, Calif.) were used to remove potential endotoxin contaminations following the manufacturer's protocol. Endotoxin levels were determined by Endosafe, Charleston, S.C. As determined by N-terminal sequencing of purified gAcrp30, the site of cleavage was just before amino acid 104 (just before amino acid 101 for human gOBG3 or apM1).

Immunoprecipitation of human plasma Apm1 followed by Western blotting was used to detect a cleavage product of apM1, the human homolog of Acrp30, using a globular head specific anti-serum for the immunoprecipitation step as well as for the detection step. Preimmune serum or serum raised against the globular head domain or human non-homologous region (HDQETTTQGPGVLLPLPKGA) were cross-linked to protein A (Sigma Chemical CO, Saint Louis, Mo.) using dimethyl-pimelimidate-dihydrochloride (Sigma Chemical Co, Saint Louis, Mo.). After washing (0.2 M salt) proteins were eluted from protein A, separated by SDS-PAGE, transferred to Protran® pure nitrocellulose membrane (Schleicher and Schuell, Keene, N. H.) using standard procedures. apM1. products were visualized using globular head domain antibodies labeled with biotin; horseradish peroxidase conjugated to Streptavidin and CN/DAB substrate kit (Pierce, Rockford, Ill.) according to manufacturer's instructions.

The apparent molecular weight of this truncated form was 27 kDa, corresponding to about 70% of the complete form of apM1. (FIG. 23B, Lane IV). This truncated form was not detectable when immunoprecipitation was performed using a different antibody directed against the human non-homologous region (HDQETTTQGPGVLLPLPKGA) of apM1; this domain is located toward the NH₂ terminal end of the protein outside of the globular domain (FIG. 23, Lane V). Both anti-apM1. antibodies directed against either the globular or the non-globular domain identified the full-length form of the protein, as well as a low abundance dimer of apparent MW 74 kDa.

Example 16

Effect of gAcrp30 on FFA Following Intralipid Injection

Two groups of mice (n=5 each) were intravenously (tail vein) injected with 30 μL bolus of Intralipid-20% (Clintec) to generate a sudden rise in plasma FFAs, thus by-passing intestinal absorption. (Intralipid is an intravenous fat emulsion used in nutritional therapy). A treated group (□ gAcrp30-treated) was injected with gAcrp30 (25 μg) at 30 and 60 minutes before Intralipid was given, while control animals (□ control) received saline. Plasma was isolated and FFAs were measured as described previously.

The effect of gAcrp30 on the decay in plasma FFAs following the peak induced by Intralipid injection was then monitored. As shown in FIG. 24, gAcrp30 accelerates the removal of FFAs from plasma after Intralipid injection. Thus, gAcrp30 accelerates the clearance of FFAs without interfering with intestinal absorption. Although not wishing to be bound by any theory, because Intralipid does not elicit a significant insulin response, the results also indicate that gAcrp30 regulation of FFA metabolism occurs independently of insulin.

Example 17

Solubilization of OBG3 and Fragments Thereof

Vector Construction

Polynucleotides encoding for polypeptides comprising amino acid residues 101-244, 105-244, or 110-244 of apM1. (SEQ ID NO:6) or amino acid residues 104-247 of Acrp30 (SEQ ID NO:4) were cloned into bacterial expression vector pTrcHis. Polynucleotides encoding for polypeptides comprising amino acid residues 101-244, 110-244 or 19-244 (full length) of apM1. (SEQ ID NO:6) were cloned into a modified bacterial expression vector pET30a (containing His Tag at C-terminal of the apM1. polypeptides). Polynucleotides encoding for polypeptides comprising amino acid residues 101-244, 105-244, 110-244, or 19-244 of apM1 (SEQ ID NO:6) or amino acid residues 104-247 of Acrp30 (SEQ ID NO:4) were cloned into Baculoviral expression vector FastBacHT. Polynucleotides encoding for polypeptides comprising amino acid residues 101-244, 105-244, or 110-244 of apM1. (SEQ ID NO:6) were cloned into mammalian expression vector pcDNA4HisMax. Polynucleotides encoding for polypeptides comprising amino acid residues 1-244 or 1-19 and 101-244 of apM1 (SEQ ID NO:6) were cloned into mammalian expression vector pcDNA3.1Hygro. Alternatively, polynucleotides encoding for polypeptides comprising amino acid residues 1-244, 19-244, 101-244, 105-244, 110 -244 or any fragment of apM1. (SEQ ID NO:6), or amino acid residues 104-247 or any fragment of Acrp30 (SEQ ID NO:4) are cloned into any bacterial, mammalian, or baculoviral expression vector, preferably selected from pTrcHis, pET30a, FastBacHT, pcDNA4His, or pcDNA3.1Hygro. Alternately, polynucleotides encoding for polypeptides comprising amino acid residues 19-244, 101-244, 105-244, 110-244 or any fragment of apM1. (SEQ ID NO:6), or amino acid residues 104-247 or any fragment of Acrp30 (SEQ ID NO:4) are cloned into Pichia Pastoris (yeast) expression vector, preferably PHIL-S 1.

Cell Growth and Induction of OBG3 Expression

DH5alpha E. coli host cells were transformed with a pTrcHis expression vector containing polynucleotides of SEQ ID NO:5 encoding for a polypeptide comprising amino acid residues 110-244 of apM1. (SEQ ID NO:6), also referred to as gapM1. (110-244). Cells were grown overnight at room temperature (˜22° C.) and induced at an approximate density of 0.45 OD₆₀₀. The cells were induced with 1 mM IPTG for 6 hours at room temperature. Cells were harvested and lysed by French Press and sonication in buffer comprising 50 mM Bis-Tris-Propane, pH 8.75, 1% Triton X-100, 5 mM MgCl₂ and 5 mM CaCl₂ and 50 mM NaCl. Extracts were centrifuged to obtain a soluble fraction. The soluble protein in the supernatant fraction was passed over a Ni-affinity column at 5° C. The column was then washed with 25 mM imidazole. Alternatively, several washes of increasing imidazole up to 25 mM is used. The gapM1. (110-224) protein was eluted from the column with a concentration of 100 mM imidazole. The the eluant comprising the gapM1. (110-244) fraction was run over an ion exchange column using DEAE resin. Alternatively, a hydroxyapatite column is used. The gapM1. (110-244) protein was eluted with high salt buffer and dialyzed to reduce the salt concentration. This material was used in Example 18 to assay affect on FFA oxidation.

Example 18

Effect of gapM1 on Free Fatty Acid Oxidation

Bacterially expressed gapMl-His tag (110-244), gAcrp30-his tag (104-247) and gAcrp30 (104-247)-N-term-His/C-tern-Flag tagged proteins were compared for activity in a free fatty acid (FFA) oxidation assay. Briefly, C2C12 cells and SKMC cells were grown to ˜95% confluency. Cells were differentiated for 7 days. Cells were starved overnight in serum free DME containing 0.5% Albumax I (fatty acid rich BSA). Cells were then preincubated in medium comprising Basal Medium Eagle (BME), 20 mM HEPES, pH7.4, 4 mM L-Glutamine, 1% fatty acid free BSA (0.15 mM). (As a negative control, a flask with no cells was included with 2 ml of Preincubation media alone.) Cells were treated with either 5.0 ug/ml (0.275 mg/ml) gAcrp30 trypsin-cleaved, 5.0 ug/ml (0.36 mg/ml) gAcrp30 bacterially expressed, 5.0 ug/ml (0.24 mg/ml) gAcrp30 His-Flag tagged bacterially expressed, 5.0 ug/ml gapM1(110-244) (0.09 mg/ml) bacterially expressed (described in Example 17) for 2 hours at 37° C. As a positive control, 100 uM isoproteranol was added to the cells 15 minutes before addition of perchloric acid.

Treated cells were assayed as follows:

1. 100 ul of media with 4 mM palmitic acid and 4 uCi/ml 14C-palmitic acid was added to each of the flasks, and cells were incubated at 37° C. for 90 minutes.

2. Using a 1 ml Luer-lok syringe, Solvable was injected into the center well containing a filter paper approximately 15 min before adding the perchloric acid.

3. Using a 1 ml Luer-lok syringe, 1 ml perchloric acid was added and cells were incubated for 1 hour at 37° C.

4. After 1 hour, 15 ml Hionic Fluor was added and filter paper was transferred into appropriate scintillation vial and shaken for 30 min at 4° C., then counted.

The N-terminally His tagged gapM-1 (110-244), gAcrp30 (104-247) and isoproterenol (positive control) increased FFA oxidation to levels greater than that in non-treated cells. The recombinant mouse protein containing both an N-term His tag and a C-term FLAG tag was not active in this assay.

Additional References

1. Shapiro, L. & Scherer, P. E., “The crystal structure of a complement-1q family protein suggests an evolutionary link to tumor necrosis factor”, Curr Biol 8, 335 -338 (1998).

2. Kishore, U. & Reid, K. B., “Modular organization of proteins containing C1q-like globular domain”, Immunopharmacology 42, 15-21 (1999).

3. Kavety, B. & Morgan, J. I., “Characterization of transcript processing of the gene encoding precerebellin-1”, Brain Res Mol Brain Res 63, 98-104 (1998).

4. Satoh, F. et al., “Cerebellin and cerebellin mRNA in the human brain, adrenal glands and the tumour tissues of adrenal tumour, ganglioneuroblastoma and neuroblastoma”, J Endocrinol 154, 27-34 (1997).

5. Mazzocchi, G. et al., “Cerebellin enhances in vitro secretory activity of human adrenal gland”, J Clin Endocrinol Metab 84, 632-635 (1999).

6. Kondo, N. & Kondo, J., “Identification of novel blood proteins specific for mammalian hibernation”, J Biol Chem 267, 473-478 (1992).

7. Takamatsu, N., Ohba, K., Kondo, J., Kondo, N. & Shiba, T., “Hibernation-associated gene regulation of plasma proteins with a collagen-like domain in mammalian hibernators”, Mol Cell Biol 13, 1516-1521 (1993).

8. Spiegelman, B. M. & Hotamisligil, G. S., “Through thick and thin: wasting, obesity, and TNF alpha”, Cell 73, 625-627 (1993).

9. Saito, K. et al., “Regulation of gelatin-binding protein 28 (GBP28) gene expression by C/EBP”, Biol Pharm Bull 22, 1158-1162 (1999).

10. Takamatsu, N. et al., “Expression of multiple alphal-antitrypsin-like genes in hibernating species of the squirrel family”, Gene 204, 127-132 (1997).

11. Ross, S. R., Graves, R. A. & Spiegelman, B. M. ,“Targeted expression of a toxin gene to adipose tissue: transgenic mice resistant to obesity”, Genes Dev 7, 1318 -1324 (1993).

12. Moitra, J. et al., “Life without white fat: a transgenic mouse”, Genes Dev 12, 3168 -3181 (1998).

13. Peters, S. J., Dyck, D. J., Bonen, A. & Spriet, L. L., “Effects of epinephrine on lipid metabolism in resting skeletal muscle”, Am J Physiol 275, E300-309 (1998).

14. Rigoitti, A., Acton, S. L. & Krieger, M., “The class B scavenger receptors SR-BI and CD36 are receptors for anionic phospholipids”,J Biol Chem 270, 16221-16224 (1995).

15. Hirsch, D., Stahl, A. & Lodish, H. F., “A family of fatty acid transporters conserved from mycobacterium to man”, Proc Natl Acad Sci U S A 95, 8625-8629 (1998).

16. Hotamisligil, G. S. et al., “IRS-1-mediated inhibition of insulin receptor tyrosine kinase activity in TNF-alpha- and obesity-induced insulin resistance”, Science 271, 665-668 (1996).

17. Peraldi, P. & Spiegelman, B., “TNF-alpha and insulin resistance: summary and future prospects”, Mol Cell Biochem 182, 169-175 (1998).

18. Kirchgessner, T. G., Uysal, K. T., Wiesbrock, S. M., Marino, M. W. & Hotamisligil, G. S., “Tumor necrosis factor-alpha contributes to obesity-related hyperleptinemia by regulating leptin release from adipocytes”, J Clin Invest 100, 2777-2782 (1997).

19. Uysal, K. T., Wiesbrock, S. M., Marino, M. W. & Hotamisligil, G. S., “Protection from obesity-induced insulin resistance in mice lacking TNF-alpha function”, Nature 389, 610-614 (1997).

20. Jensen, D. R. et al. “Prevention of diet-induced obesity in transgenic mice overexpressing skeletal muscle lipoprotein lipase”, Am J Physiol 273, R683-689 (1997).

21. Levak-Frank, S. et al., “Muscle-specific overexpression of lipoprotein lipase causes a severe myopathy characterized by proliferation of mitochondria and peroxisomes in transgenic mice”, J Clin Invest 96, 976-986 (1995).

22. Scheja, L. et al., “Altered insulin secretion associated with reduced lipolytic efficiency in aP2−/−mice”, Diabetes 48, 1987-1994 (1999).

23. Griffin, M. E. et al., “Free fatty acid-induced insulin resistance is associated with activation of protein kinase C theta and alterations in the insulin signaling cascade”, Diabetes 48, 1270-1274 (1999).

24. Dresner, A. et al., “Effects of free fatty acids on glucose transport and IRS-1-associated phosphatidylinositol 3-kinase activity”, J Clin Invest 103, 253-259 (1999).

25. Kelley, D. E., Goodpaster, B., Wing, R. R. & Simoneau, J. A., “Skeletal muscle fatty acid metabolism in association with insulin resistance, obesity, and weight loss”, Am J Physiol 277, E1130-1141 (1999).

26. Roden, M. et al., “Mechanism of free fatty acid-induced insulin resistance in humans”, J Clin Invest 97, 2859-2865 (1996).

                   #             SEQUENCE LISTING <160> NUMBER OF SEQ ID NOS: 7 <210> SEQ ID NO 1 <211> LENGTH: 1152 <212> TYPE: DNA <213> ORGANISM: mus musculus <400> SEQUENCE: 1 gaattcggca cgaggg atg cta ctg ttg caa gct ctc  #ctg ttc ctc tta atc     52                    #Met Leu Leu Leu Gln Ala Leu Leu Phe Leu L #eu Ile                    #1               5    #                10 ctg ccc agt cat gcc gaa gat gac gtt act ac #a act gaa gag cta gct      100 Leu Pro Ser His Ala Glu Asp Asp Val Thr Th #r Thr Glu Glu Leu Ala         15           #        20           #        25 cct gct ttg gtc cct cca ccc aag gga act tg #t gca ggt tgg atg gca      148 Pro Ala Leu Val Pro Pro Pro Lys Gly Thr Cy #s Ala Gly Trp Met Ala     30               #    35               #    40 ggc atc cca gga cat tct ggc cac aat ggc ac #a cca ggc cgt gat ggc      196 Gly Ile Pro Gly His Ser Gly His Asn Gly Th #r Pro Gly Arg Asp Gly 45                   #50                   #55                   #60 aga gat ggc act cct gga gag aag gga gag aa #a gga gat tca ggt ctt      244 Arg Asp Gly Thr Pro Gly Glu Lys Gly Glu Ly #s Gly Asp Ser Gly Leu                 65   #                70   #                75 ctt ggt cct aag ggt gag aca gga gat gtt gg #a atg aca gga gct gaa      292 Leu Gly Pro Lys Gly Glu Thr Gly Asp Val Gl #y Met Thr Gly Ala Glu             80       #            85       #            90 ggg cct cgg ggc ttc ccc gga acc cct ggc ag #g aaa gga gag cct gga      340 Gly Pro Arg Gly Phe Pro Gly Thr Pro Gly Ar #g Lys Gly Glu Pro Gly         95           #        100          #        105 gaa gcc gct tat gtg tat cgc tca ggc ttc ag #t gtg ggg ctg gag acc      388 Glu Ala Ala Tyr Val Tyr Arg Ser Gly Phe Se #r Val Gly Leu Glu Thr     110               #   115               #   120 cgc gtc act gtt ccc aat gta ccc att cgc tt #t act aag atc ttc tac      436 Arg Val Thr Val Pro Asn Val Pro Ile Arg Ph #e Thr Lys Ile Phe Tyr 125                 1 #30                 1 #35                 1 #40 aac caa cag aat cat tat gac aac agc act gg #c aag ttc tac tgc aac      484 Asn Gln Gln Asn His Tyr Asp Asn Ser Thr Gl #y Lys Phe Tyr Cys Asn                 145   #               150   #               155 att ccg gga ctc tac tac ttc tct tac cac at #c acg gtg tac atg aaa      532 Ile Pro Gly Leu Tyr Tyr Phe Ser Tyr His Il #e Thr Val Tyr Met Lys             160       #           165       #           170 gat gtg aag gtg agc ctc ttc aag aag gac aa #g gcc gtt ctc ttc acc      580 Asp Val Lys Val Ser Leu Phe Lys Lys Asp Ly #s Ala Val Leu Phe Thr         175           #       180           #       185 tac gac cag tat cag gaa aag aat gtg gac ca #g gcc tct ggc tct gtg      628 Tyr Asp Gln Tyr Gln Glu Lys Asn Val Asp Gl #n Ala Ser Gly Ser Val     190               #   195               #   200 ctc ctc cat ctg gag gtg gga gac caa gtc tg #g ctc cag gtg tat ggg      676 Leu Leu His Leu Glu Val Gly Asp Gln Val Tr #p Leu Gln Val Tyr Gly 205                 2 #10                 2 #15                 2 #20 gat ggg gac cac aat gga ctc tat gca gat aa #c gtc aac gac tct aca      724 Asp Gly Asp His Asn Gly Leu Tyr Ala Asp As #n Val Asn Asp Ser Thr                 225   #               230   #               235 ttt act ggc ttt ctt ctc ttc cat gat acc aa #c tga ctgcaactac           770 Phe Thr Gly Phe Leu Leu Phe His Asp Thr As #n  *             240       #           245 tcatagccca tacaccagga gaatcatgga acgtcgacac actttcagct ta #gtttgaga    830 gattgatttt attgcttagt ttgagagtcc tgagtattat ccacacgtgt ac #tcacttgt    890 tcattaaacg actttataaa aaataatttg tgttcctagt ccagaaaaaa ag #gcactccc    950 tggtctccac gactcttaca tggtagcaat aacagaatga aaatcacatt tg #gtatgggg   1010 gcttcacaat attcgcatga ctgtctggaa gtagaccatg ctatttttct gc #tcactgta   1070 cacaaatatt gttcacataa accctataat gtaaatatga aatacagtga tt #atcttctc   1130 aaaaaaaact cgtgccgaat tc            #                   #               1152 <210> SEQ ID NO 2 <211> LENGTH: 247 <212> TYPE: PRT <213> ORGANISM: mus musculus <400> SEQUENCE: 2 Met Leu Leu Leu Gln Ala Leu Leu Phe Leu Le #u Ile Leu Pro Ser His 1               5    #                10   #                15 Ala Glu Asp Asp Val Thr Thr Thr Glu Glu Le #u Ala Pro Ala Leu Val             20       #            25       #            30 Pro Pro Pro Lys Gly Thr Cys Ala Gly Trp Me #t Ala Gly Ile Pro Gly         35           #        40           #        45 His Ser Gly His Asn Gly Thr Pro Gly Arg As #p Gly Arg Asp Gly Thr     50               #    55               #    60 Pro Gly Glu Lys Gly Glu Lys Gly Asp Ser Gl #y Leu Leu Gly Pro Lys 65                   #70                   #75                   #80 Gly Glu Thr Gly Asp Val Gly Met Thr Gly Al #a Glu Gly Pro Arg Gly                 85   #                90   #                95 Phe Pro Gly Thr Pro Gly Arg Lys Gly Glu Pr #o Gly Glu Ala Ala Tyr             100       #           105       #           110 Val Tyr Arg Ser Gly Phe Ser Val Gly Leu Gl #u Thr Arg Val Thr Val         115           #       120           #       125 Pro Asn Val Pro Ile Arg Phe Thr Lys Ile Ph #e Tyr Asn Gln Gln Asn     130               #   135               #   140 His Tyr Asp Asn Ser Thr Gly Lys Phe Tyr Cy #s Asn Ile Pro Gly Leu 145                 1 #50                 1 #55                 1 #60 Tyr Tyr Phe Ser Tyr His Ile Thr Val Tyr Me #t Lys Asp Val Lys Val                 165   #               170   #               175 Ser Leu Phe Lys Lys Asp Lys Ala Val Leu Ph #e Thr Tyr Asp Gln Tyr             180       #           185       #           190 Gln Glu Lys Asn Val Asp Gln Ala Ser Gly Se #r Val Leu Leu His Leu         195           #       200           #       205 Glu Val Gly Asp Gln Val Trp Leu Gln Val Ty #r Gly Asp Gly Asp His     210               #   215               #   220 Asn Gly Leu Tyr Ala Asp Asn Val Asn Asp Se #r Thr Phe Thr Gly Phe 225                 2 #30                 2 #35                 2 #40 Leu Leu Phe His Asp Thr Asn                 245 <210> SEQ ID NO 3 <211> LENGTH: 1276 <212> TYPE: DNA <213> ORGANISM: mus musculus <400> SEQUENCE: 3 ctctaaagat tgtcagtgga tctgacgaca ccaaaagggc tcagg atg  #cta ctg ttg     57                    #                   #              Met Leu L #eu Leu                    #                   #              1 caa gct ctc ctg ttc ctc tta atc ctg ccc ag #t cat gcc gaa gat gac      105 Gln Ala Leu Leu Phe Leu Leu Ile Leu Pro Se #r His Ala Glu Asp Asp 5                   # 10                  # 15                  # 20 gtt act aca act gaa gag cta gct cct gct tt #g gtc cct cca ccc aag      153 Val Thr Thr Thr Glu Glu Leu Ala Pro Ala Le #u Val Pro Pro Pro Lys                 25   #                30   #                35 gga act tgt gca ggt tgg atg gca ggc atc cc #a gga cat cct ggc cac      201 Gly Thr Cys Ala Gly Trp Met Ala Gly Ile Pr #o Gly His Pro Gly His             40       #            45       #            50 aat ggc aca cca ggc cgt gat ggc aga gat gg #c act cct gga gag aag      249 Asn Gly Thr Pro Gly Arg Asp Gly Arg Asp Gl #y Thr Pro Gly Glu Lys         55           #        60           #        65 gga gag aaa gga gat gca ggt ctt ctt ggt cc #t aag ggt gag aca gga      297 Gly Glu Lys Gly Asp Ala Gly Leu Leu Gly Pr #o Lys Gly Glu Thr Gly     70               #    75               #    80 gat gtt gga atg aca gga gct gaa ggg cca cg #g ggc ttc ccc gga acc      345 Asp Val Gly Met Thr Gly Ala Glu Gly Pro Ar #g Gly Phe Pro Gly Thr 85                   #90                   #95                   #100 cct ggc agg aaa gga gag cct gga gaa gcc gc #t tat atg tat cgc tca      393 Pro Gly Arg Lys Gly Glu Pro Gly Glu Ala Al #a Tyr Met Tyr Arg Ser                 105   #               110   #               115 gcg ttc agt gtg ggg ctg gag acc cgc gtc ac #t gtt ccc aat gta ccc      441 Ala Phe Ser Val Gly Leu Glu Thr Arg Val Th #r Val Pro Asn Val Pro             120       #           125       #           130 att cgc ttt act aag atc ttc tac aac caa ca #g aat cat tat gac ggc      489 Ile Arg Phe Thr Lys Ile Phe Tyr Asn Gln Gl #n Asn His Tyr Asp Gly         135           #       140           #       145 agc act ggc aag ttc tac tgc aac att ccg gg #a ctc tac tac ttc tct      537 Ser Thr Gly Lys Phe Tyr Cys Asn Ile Pro Gl #y Leu Tyr Tyr Phe Ser     150               #   155               #   160 tac cac atc acg gtg tac atg aaa gat gtg aa #g gtg agc ctc ttc aag      585 Tyr His Ile Thr Val Tyr Met Lys Asp Val Ly #s Val Ser Leu Phe Lys 165                 1 #70                 1 #75                 1 #80 aag gac aag gcc gtt ctc ttc acc tac gac ca #g tat cag gaa aag aat      633 Lys Asp Lys Ala Val Leu Phe Thr Tyr Asp Gl #n Tyr Gln Glu Lys Asn                 185   #               190   #               195 gtg gac cag gcc tct ggc tct gtg ctc ctc ca #t ctg gag gtg gga gac      681 Val Asp Gln Ala Ser Gly Ser Val Leu Leu Hi #s Leu Glu Val Gly Asp             200       #           205       #           210 caa gtc tgg ctc cag gtg tat ggg gat ggg ga #c cac aat gga ctc tat      729 Gln Val Trp Leu Gln Val Tyr Gly Asp Gly As #p His Asn Gly Leu Tyr         215           #       220           #       225 gca gat aac gtc aac gac tct aca ttt act gg #c ttt ctt ctc tac cat      777 Ala Asp Asn Val Asn Asp Ser Thr Phe Thr Gl #y Phe Leu Leu Tyr His     230               #   235               #   240 gat acc aac tga ctgcaactac ccatagccca tacaccagga ga #atcatgga          829 Asp Thr Asn  * 245 acagtcgaca cactttcagc ttagtttgag agattgattt tattgcttag tt #tgagagtc    889 ctgagtatta tccacacgtg tactcacttg ttcattaaac gactttataa aa #aataattt    949 gtgttcctag tccagaaaaa aaggcactcc ctggtctcca cgactcttac at #ggtagcaa   1009 taacagaatg aaaatcacat ttggtatggg ggcttcacaa tattcgcatg ac #tgtctgga   1069 agtagaccat gctatttttc tgctcactgt acacaaatat tgttcacata aa #ccctataa   1129 tgtaaatatg aaatacagtg attactcttc tcacaggctg agtgtatgaa tg #tctaaaga   1189 cccataagta ttaaagtggt agggataaat tggaaaaaaa aaaaaaaaaa aa #gaaaaact   1249 ttagagcaca ctggcggccg ttactag           #                   #           1276 <210> SEQ ID NO 4 <211> LENGTH: 247 <212> TYPE: PRT <213> ORGANISM: mus musculus <400> SEQUENCE: 4 Met Leu Leu Leu Gln Ala Leu Leu Phe Leu Le #u Ile Leu Pro Ser His 1               5    #                10   #                15 Ala Glu Asp Asp Val Thr Thr Thr Glu Glu Le #u Ala Pro Ala Leu Val             20       #            25       #            30 Pro Pro Pro Lys Gly Thr Cys Ala Gly Trp Me #t Ala Gly Ile Pro Gly         35           #        40           #        45 His Pro Gly His Asn Gly Thr Pro Gly Arg As #p Gly Arg Asp Gly Thr     50               #    55               #    60 Pro Gly Glu Lys Gly Glu Lys Gly Asp Ala Gl #y Leu Leu Gly Pro Lys 65                   #70                   #75                   #80 Gly Glu Thr Gly Asp Val Gly Met Thr Gly Al #a Glu Gly Pro Arg Gly                 85   #                90   #                95 Phe Pro Gly Thr Pro Gly Arg Lys Gly Glu Pr #o Gly Glu Ala Ala Tyr             100       #           105       #           110 Met Tyr Arg Ser Ala Phe Ser Val Gly Leu Gl #u Thr Arg Val Thr Val         115           #       120           #       125 Pro Asn Val Pro Ile Arg Phe Thr Lys Ile Ph #e Tyr Asn Gln Gln Asn     130               #   135               #   140 His Tyr Asp Gly Ser Thr Gly Lys Phe Tyr Cy #s Asn Ile Pro Gly Leu 145                 1 #50                 1 #55                 1 #60 Tyr Tyr Phe Ser Tyr His Ile Thr Val Tyr Me #t Lys Asp Val Lys Val                 165   #               170   #               175 Ser Leu Phe Lys Lys Asp Lys Ala Val Leu Ph #e Thr Tyr Asp Gln Tyr             180       #           185       #           190 Gln Glu Lys Asn Val Asp Gln Ala Ser Gly Se #r Val Leu Leu His Leu         195           #       200           #       205 Glu Val Gly Asp Gln Val Trp Leu Gln Val Ty #r Gly Asp Gly Asp His     210               #   215               #   220 Asn Gly Leu Tyr Ala Asp Asn Val Asn Asp Se #r Thr Phe Thr Gly Phe 225                 2 #30                 2 #35                 2 #40 Leu Leu Tyr His Asp Thr Asn                 245 <210> SEQ ID NO 5 <211> LENGTH: 4517 <212> TYPE: DNA <213> ORGANISM: Homo sapiens <400> SEQUENCE: 5 ctgattccat accagagggg ctcagg atg ctg ttg ctg gga  #gct gtt cta ctg      53                    #           Met Leu Leu Leu  #Gly Ala Val Leu Leu                    #           1        #        5 cta tta gct ctg ccc ggg cat gac cag gaa ac #c acg act caa ggg ccc      101 Leu Leu Ala Leu Pro Gly His Asp Gln Glu Th #r Thr Thr Gln Gly Pro 10                   #15                   #20                   #25 gga gtc ctg ctt ccc ctg ccc aag ggg gcc tg #c aca ggt tgg atg gcg      149 Gly Val Leu Leu Pro Leu Pro Lys Gly Ala Cy #s Thr Gly Trp Met Ala                 30   #                35   #                40 ggc atc cca ggg cat ccg ggc cat aat ggg gc #c cca ggc cgt gat ggc      197 Gly Ile Pro Gly His Pro Gly His Asn Gly Al #a Pro Gly Arg Asp Gly             45       #            50       #            55 aga gat ggc acc cct ggt gag aag ggt gag aa #a gga gat cca ggt ctt      245 Arg Asp Gly Thr Pro Gly Glu Lys Gly Glu Ly #s Gly Asp Pro Gly Leu         60           #        65           #        70 att ggt cct aag gga gac atc ggt gaa acc gg #a gta ccc ggg gct gaa      293 Ile Gly Pro Lys Gly Asp Ile Gly Glu Thr Gl #y Val Pro Gly Ala Glu     75               #    80               #    85 ggt ccc cga ggc ttt ccg gga atc caa ggc ag #g aaa gga gaa cct gga      341 Gly Pro Arg Gly Phe Pro Gly Ile Gln Gly Ar #g Lys Gly Glu Pro Gly 90                   #95                   #100                  #105 gaa ggt gcc tat gta tac cgc tca gca ttc ag #t gtg gga ttg gag act      389 Glu Gly Ala Tyr Val Tyr Arg Ser Ala Phe Se #r Val Gly Leu Glu Thr                 110   #               115   #               120 tac gtt act atc ccc aac atg ccc att cgc tt #t acc aag atc ttc tac      437 Tyr Val Thr Ile Pro Asn Met Pro Ile Arg Ph #e Thr Lys Ile Phe Tyr             125       #           130       #           135 aat cag caa aac cac tat gat ggc tcc act gg #t aaa ttc cac tgc aac      485 Asn Gln Gln Asn His Tyr Asp Gly Ser Thr Gl #y Lys Phe His Cys Asn         140           #       145           #       150 att cct ggg ctg tac tac ttt gcc tac cac at #c aca gtc tat atg aag      533 Ile Pro Gly Leu Tyr Tyr Phe Ala Tyr His Il #e Thr Val Tyr Met Lys     155               #   160               #   165 gat gtg aag gtc agc ctc ttc aag aag gac aa #g gct atg ctc ttc acc      581 Asp Val Lys Val Ser Leu Phe Lys Lys Asp Ly #s Ala Met Leu Phe Thr 170                 1 #75                 1 #80                 1 #85 tat gat cag tac cag gaa aat aat gtg gac ca #g gcc tcc ggc tct gtg      629 Tyr Asp Gln Tyr Gln Glu Asn Asn Val Asp Gl #n Ala Ser Gly Ser Val                 190   #               195   #               200 ctc ctg cat ctg gag gtg ggc gac caa gtc tg #g ctc cag gtg tat ggg      677 Leu Leu His Leu Glu Val Gly Asp Gln Val Tr #p Leu Gln Val Tyr Gly             205       #           210       #           215 gaa gga gag cgt aat gga ctc tat gct gat aa #t gac aat gac tcc acc      725 Glu Gly Glu Arg Asn Gly Leu Tyr Ala Asp As #n Asp Asn Asp Ser Thr         220           #       225           #       230 ttc aca ggc ttt ctt ctc tac cat gac acc aa #c tga tcaccactaa           771 Phe Thr Gly Phe Leu Leu Tyr His Asp Thr As #n  *     235               #   240               #   245 ctcagagcct cctccaggcc aaacagcccc aaagtcaatt aaaggctttc ag #tacggtta    831 ggaagttgat tattatttag ttggaggcct ttagatatta ttcattcatt ta #ctcattca    891 tttattcatt cattcatcaa gtaactttaa aaaaatcata tgctatgttc cc #agtcctgg    951 ggagcttcac aaacatgacc agataactga ctagaaagaa gtagttgaca gt #gctatttt   1011 gtgcccactg tctctcctga tgctcatatc aatcctataa ggcacaggga ac #aagcattc   1071 tcctgttttt acagattgta tcctgaggct gagagagtta agtgaatgtc ta #aggtcaca   1131 cagtattaag tgacagtgct agaaatcaaa cccagagctg tggactttgt tc #actagact   1191 gtgccctttt atagaggtac atgttctctt tggagtgttg gtaggtgtct gt #ttcccacc   1251 tcacctgaga gccattgaat ttgccttcct catgaattaa aacctccccc aa #gcagagct   1311 tcctcagaga aagtggttct atgatgaagt cctgtcttgg aaggactact ac #tcaatggc   1371 ccctgcacta ctctacttcc tcttacctat gtcccttctc atgcctttcc ct #ccaacggg   1431 gaaagccaac tccatctcta agtgctgaac tcatccctgt tcctcaaggc ca #cctggcca   1491 ggagcttctc tgatgtgata tccacttttt tttttttttg agatggagtc tc #actctgtc   1551 acccaggctg gagtacagtg acacgacctc ggctcactgc agcctccttc tc #ctgggtcc   1611 aagcaattat tgtgcctcag cctcccgagt agctgagact tcaggtgcat tc #caccacac   1671 atggctaatt tttgtatttt tagtagaaat ggggtttcgt catgttggcc ag #gctggtct   1731 cgaactcctg gcctaggtga tccacccgcc tcgacctccc aaagtgctgg ga #ttacaggc   1791 atgagccacc atgcccagtc gatatctcac tttttatttt gccatggatg ag #agtcctgg   1851 gtgtgaggaa cacctcccac caggctagag gcaactgccc aggaaggact gt #gcttccgt   1911 cacctctaaa tcccttgcag atccttgata aatgcctcat gaagaccaat ct #cttgaatc   1971 ccatatctac ccagaattaa ctccattcca gtctctgcat gtaatcagtt tt #atccacag   2031 aaacattttc attttaggaa atccctggtt taagtatcaa tccttgttca gc #tggacaat   2091 atgaatcttt tccactgaag ttagggatga ctgtgatttt cagaacacgt cc #agaatttt   2151 tcatcaagaa ggtagcttga gcctgaaatg caaaacccat ggaggaattc tg #aagccatt   2211 gtctccttga gtaccaacag ggtcagggaa gactgggcct cctgaattta tt #attgttct   2271 ttaagaatta caggttgagg tagttgatgg tggtaaacat tctctcagga ga #caataact   2331 ccagtgatgt ttttcaaaga ttttagcaaa aacagagtaa atagcattct ct #atcaatat   2391 ataaatttaa aaaactatct ttttgcttac agttttaaat tctgaacaat tt #ctcttata   2451 tgtgtattgc taatcattaa ggtattattt tttccacata taaagctttg tc #tttttgtt   2511 gttgttgttg tttttaagat ggagtttccc tctgttgcca ggctagagtg ca #gtggcatg   2571 atctcggctt actgcaacct ttgcctccca ggtttaagcg attcttctgc ct #cagcctcc   2631 cgagtagctg ggaccacagg tgcctaccac catgccaggc taatttttgt at #ttttagta   2691 aagacagggt ttcaccatat tggccaggct ggtctcgaac tcctgacctt gt #gatctgcc   2751 cgcctccatt gtgttgttat ttgtgagaaa gatagatatg aggtttagag ag #ggatgaag   2811 aggtgagagt aagccttgtg ttagtcagaa ctctgtgttg tgaatgtcat tc #acaacaga   2871 aaacccaaaa tattatgcaa actactgtaa gcaagaaaaa taaaggaaaa at #ggaaacat   2931 ttattccttt gcataataga aattaccaga gttgttctgt ctttagataa gg #tttgaacc   2991 aaagctcaaa acaatcaaga cccttttctg tatgtccttc tgttctgcct tc #cgcagtgt   3051 aggctttacc ctcaggtgct acacagtata gttctagggt ttccctcccg at #atcaaaaa   3111 gactgtggcc tgcccagctc tcgtatcccc aagccacacc atctggctaa at #ggacatca   3171 tgttttctgg tgatgcccaa agaggagaga ggaagctctc tttcccagat gc #cccagcaa   3231 gtgtaacctt gcatctcatt gctctggctg agttgtgtgc ctgtttctga cc #aatcactg   3291 agtcaggagg atgaaatatt catattgact taattgcagc ttaagttagg gg #tatgtaga   3351 ggtattttcc ctaaagcaaa attgggacac tgttatcaga aataggagag tg #gatgatag   3411 atgcaaaata atacctgtcc acaacaaact cttaatgctg tgtttgagct tt #catgagtt   3471 tcccagagag acatagctgg aaaattccta ttgattttct ctaaaatttc aa #caagtagc   3531 taaagtctgg ctatgctcac agtctcacat ctggtggggg tgggctcctt ac #agaacacg   3591 ctttcacagt taccctaaac tctctggggc agggttattc ctttgtggaa cc #agaggcac   3651 agagacagtc aactgaggcc caacagaggc ctgagagaaa ctgaggtcaa ga #tttcagga   3711 ttaatggtcc tgtgatgctt tgaagtacaa ttgtggattt gtccaattct ct #ttagttct   3771 gtcagctttt gcttcatata ttttagcgct ctattattag atatatacat gt #ttagtatt   3831 atgtcttatt ggtgcattta ctctcttatc attatgtaat gtccttcttt at #ctgtgata   3891 attttctgtg ttctgaagtc tactttgtct aaaaataaca tacgcactca ac #ttcctttt   3951 ctttcttcct tcctttcttt cttccttcct ttctttctct ctctctcttt cc #ttccttcc   4011 ttcctccttt tctctctctc tctctctctc tctcttttct tgacagactc tc #gttctgtg   4071 gccctggctg gagttcagtg gtgtgatctt ggctcactgc tacctctacc at #gagcaatt   4131 ctcctgcctc agcctcccaa gtagctggaa ctacaggctc atgccactgc gc #ccagctaa   4191 tttttgtatt tttcgtagag acggggtttc accacattcg tcaggttggt tt #caaactcc   4251 tgactttgtg atccacccgc ctcggcctcc caaagtgctg ggattacagg ca #tgagccat   4311 cacacctggt caactttctt ttgattagtg tttttgtggt atatcttttt cc #atcatgtt   4371 actttaaata tatctatatt attgtattta aaatgtgttt cttacagact gc #atgtagtt   4431 gggtataatt tttatccagt ctaaaaatat ctgtctttta attggtgttt ag #acaattta   4491 tatttaataa aatggtggaa tttaaa           #                   #            4517 <210> SEQ ID NO 6 <211> LENGTH: 244 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 6 Met Leu Leu Leu Gly Ala Val Leu Leu Leu Le #u Ala Leu Pro Gly His 1               5    #                10   #                15 Asp Gln Glu Thr Thr Thr Gln Gly Pro Gly Va #l Leu Leu Pro Leu Pro             20       #            25       #            30 Lys Gly Ala Cys Thr Gly Trp Met Ala Gly Il #e Pro Gly His Pro Gly         35           #        40           #        45 His Asn Gly Ala Pro Gly Arg Asp Gly Arg As #p Gly Thr Pro Gly Glu     50               #    55               #    60 Lys Gly Glu Lys Gly Asp Pro Gly Leu Ile Gl #y Pro Lys Gly Asp Ile 65                   #70                   #75                   #80 Gly Glu Thr Gly Val Pro Gly Ala Glu Gly Pr #o Arg Gly Phe Pro Gly                 85   #                90   #                95 Ile Gln Gly Arg Lys Gly Glu Pro Gly Glu Gl #y Ala Tyr Val Tyr Arg             100       #           105       #           110 Ser Ala Phe Ser Val Gly Leu Glu Thr Tyr Va #l Thr Ile Pro Asn Met         115           #       120           #       125 Pro Ile Arg Phe Thr Lys Ile Phe Tyr Asn Gl #n Gln Asn His Tyr Asp     130               #   135               #   140 Gly Ser Thr Gly Lys Phe His Cys Asn Ile Pr #o Gly Leu Tyr Tyr Phe 145                 1 #50                 1 #55                 1 #60 Ala Tyr His Ile Thr Val Tyr Met Lys Asp Va #l Lys Val Ser Leu Phe                 165   #               170   #               175 Lys Lys Asp Lys Ala Met Leu Phe Thr Tyr As #p Gln Tyr Gln Glu Asn             180       #           185       #           190 Asn Val Asp Gln Ala Ser Gly Ser Val Leu Le #u His Leu Glu Val Gly         195           #       200           #       205 Asp Gln Val Trp Leu Gln Val Tyr Gly Glu Gl #y Glu Arg Asn Gly Leu     210               #   215               #   220 Tyr Ala Asp Asn Asp Asn Asp Ser Thr Phe Th #r Gly Phe Leu Leu Tyr 225                 2 #30                 2 #35                 2 #40 His Asp Thr Asn <210> SEQ ID NO 7 <211> LENGTH: 20966 <212> TYPE: DNA <213> ORGANISM: Homo sapiens <220> FEATURE: <221> NAME/KEY: misc_feature <222> LOCATION: 1..4811 <223> OTHER INFORMATION: 5′ regulatory region <221> NAME/KEY: exon <222> LOCATION: 4812..4851 <223> OTHER INFORMATION: exon 1 <221> NAME/KEY: exon <222> LOCATION: 15144..15365 <223> OTHER INFORMATION: exon 2 <221> NAME/KEY: exon <222> LOCATION: 16277..20559 <223> OTHER INFORMATION: exon 3 <221> NAME/KEY: misc_feature <222> LOCATION: 20560..20966 <223> OTHER INFORMATION: 3′ regulatory region <221> NAME/KEY: allele <222> LOCATION: 3787 <223> OTHER INFORMATION: 9-27-261 : polymorphic ba #se G or C <221> NAME/KEY: allele <222> LOCATION: 11118 <223> OTHER INFORMATION: 99-14387-129 : polymorphic  #base A or C <221> NAME/KEY: allele <222> LOCATION: 15120 <223> OTHER INFORMATION: 9-12-48 : polymorphic bas #e C or T <221> NAME/KEY: allele <222> LOCATION: 15196 <223> OTHER INFORMATION: 9-12-124 : polymorphic ba #se G or T <221> NAME/KEY: allele <222> LOCATION: 15427 <223> OTHER INFORMATION: 9-12-355 : polymorphic ba #se G or T <221> NAME/KEY: allele <222> LOCATION: 15500 <223> OTHER INFORMATION: 9-12-428 : polymorphic ba #se A or G <221> NAME/KEY: allele <222> LOCATION: 15863 <223> OTHER INFORMATION: 99-14405-105 : polymorphic  #base A or G <221> NAME/KEY: allele <222> LOCATION: 17170 <223> OTHER INFORMATION: 9-16-189 : polymorphic ba #se deletion of A <221> NAME/KEY: primer_bind <222> LOCATION: 3528..3545 <223> OTHER INFORMATION: 9-27.pu <221> NAME/KEY: primer_bind <222> LOCATION: 3928..3946 <223> OTHER INFORMATION: 9-27.rp complement <221> NAME/KEY: primer_bind <222> LOCATION: 10990..11008 <223> OTHER INFORMATION: 99-14387.pu <221> NAME/KEY: primer_bind <222> LOCATION: 11423..11442 <223> OTHER INFORMATION: 99-14387.rp complement <221> NAME/KEY: primer_bind <222> LOCATION: 15073..15092 <223> OTHER INFORMATION: 9-12.pu <221> NAME/KEY: primer_bind <222> LOCATION: 15503..15520 <223> OTHER INFORMATION: 9-12.rp complement <221> NAME/KEY: primer_bind <222> LOCATION: 15759..15776 <223> OTHER INFORMATION: 99-14405.pu <221> NAME/KEY: primer_bind <222> LOCATION: 16191..16211 <223> OTHER INFORMATION: 99-14405.rp complement <221> NAME/KEY: primer_bind <222> LOCATION: 16982..17001 <223> OTHER INFORMATION: 9-16.pu <221> NAME/KEY: primer_bind <222> LOCATION: 17384..17402 <223> OTHER INFORMATION: 9-16.rp complement <221> NAME/KEY: misc_binding <222> LOCATION: 3775..3799 <223> OTHER INFORMATION: 9-27-261.probe <221> NAME/KEY: misc_binding <222> LOCATION: 11106..11130 <223> OTHER INFORMATION: 99-14387-129.probe <221> NAME/KEY: misc_binding <222> LOCATION: 15108..15132 <223> OTHER INFORMATION: 9-12-48.probe <221> NAME/KEY: misc_binding <222> LOCATION: 15184..15208 <223> OTHER INFORMATION: 9-12-124.probe <221> NAME/KEY: misc_binding <222> LOCATION: 15415..15439 <223> OTHER INFORMATION: 9-12-355.probe <221> NAME/KEY: misc_binding <222> LOCATION: 15488..15512 <223> OTHER INFORMATION: 9-12-428.probe <221> NAME/KEY: misc_binding <222> LOCATION: 15851..15875 <223> OTHER INFORMATION: 99-14405-105.probe <221> NAME/KEY: misc_binding <222> LOCATION: 17158..17182 <223> OTHER INFORMATION: 9-16-189.probe <221> NAME/KEY: primer_bind <222> LOCATION: 3768..3786 <223> OTHER INFORMATION: 9-27-261.mis <221> NAME/KEY: primer_bind <222> LOCATION: 3788..3806 <223> OTHER INFORMATION: 9-27-261.mis complement <221> NAME/KEY: primer_bind <222> LOCATION: 11099..11117 <223> OTHER INFORMATION: 99-14387-129.mis <221> NAME/KEY: primer_bind <222> LOCATION: 11119..11137 <223> OTHER INFORMATION: 99-14387-129.mis complement <221> NAME/KEY: primer_bind <222> LOCATION: 15101..15119 <223> OTHER INFORMATION: 9-12-48.mis <221> NAME/KEY: primer_bind <222> LOCATION: 15121..15139 <223> OTHER INFORMATION: 9-12-48.mis complement <221> NAME/KEY: primer_bind <222> LOCATION: 15177..15195 <223> OTHER INFORMATION: 9-12-124.mis <221> NAME/KEY: primer_bind <222> LOCATION: 15197..15215 <223> OTHER INFORMATION: 9-12-124.mis complement <221> NAME/KEY: primer_bind <222> LOCATION: 15408..15426 <223> OTHER INFORMATION: 9-12-355.mis <221> NAME/KEY: primer_bind <222> LOCATION: 15428..15446 <223> OTHER INFORMATION: 9-12-355.mis complement <221> NAME/KEY: primer_bind <222> LOCATION: 15481..15499 <223> OTHER INFORMATION: 9-12-428.mis <221> NAME/KEY: primer_bind <222> LOCATION: 15501..15519 <223> OTHER INFORMATION: 9-12-428.mis complement <221> NAME/KEY: primer_bind <222> LOCATION: 15844..15862 <223> OTHER INFORMATION: 99-14405-105.mis <221> NAME/KEY: primer_bind <222> LOCATION: 15864..15882 <223> OTHER INFORMATION: 99-14405-105.mis complement <221> NAME/KEY: primer_bind <222> LOCATION: 17151..17169 <223> OTHER INFORMATION: 9-16-189.mis <221> NAME/KEY: primer_bind <222> LOCATION: 17171..17189 <223> OTHER INFORMATION: 9-16-189.mis complement <400> SEQUENCE: 7 gctgatctgc tgcctcagcc ttcccaaagt gctgtaattt attaggcata ag #ccactgtg     60 cctgcctagt gttgtacatt ctgtgggttt tgacaattgt atgcatctac at #gtatgtac    120 catttatagt attcctgttt ttaattttag ccattctagt aggcatgtag tg #atatctca    180 tggtgatttt aatttgcgtt tccgtaatgg ttaataatgc tgaacatctt tg #catgtgct    240 tgtttgtcat ttgtgtttcc tacttggtga aataattgtt catgtccttt gt #ccattttc    300 taattgaatt tttttttacc atttagtttt gagatttctt tatacaatct ag #atccaaat    360 ctcttgtctc aaatatggtt tgcaaataca ttcctctaat tcatatattg cc #ttttcctc    420 ctcttaacag gatgtttcac agagcaaaag ttttagtttt gttgaaatct ca #cttttcat    480 ttttttcttt agtggattgt gcttttgttg tcatatgtaa gaactcttca ct #ggccctag    540 atccttgtat tggtttccta agattgccat agcaaatcac catgaactta gt #gacaaaaa    600 gacagaaatt tattttcact tcctactgtg ggcagactag acgttaatta tt #ttcatgta    660 tgctcattcc tatgacatct ttctgatata ataattatag ttattcttaa gc #ttcaccct    720 tttttctatt agctttgtta ccttgggtgt cactttttct tttttgacat tg #tgacctat    780 gccagatcat gtctgttagt acttagccct ccattcacct ctccataatc cc #ttttgtat    840 tcctggagct tgatgcctga aatgacacat cctacattcc tttgccagat gg #gtaccagt    900 tagcttgtgc acatgggaga caaccgtgaa aagactgaag tggggaagaa gg #gaggagct    960 gttgtgtttc agtgagcgcc cttggcagtg gcggtgacag tggctcctgt tc #agtggcaa   1020 tggtggagca gctagcaaga catgcagtaa gcgcaggctc ataggctatg gt #ccaggagc   1080 agtcaccgat tcctggtctt taggcaatat catctccctt tgcttctcca gc #ctttctaa   1140 aattattgta ccttgactag tacaattttt tagtattggg ggtagtccaa gg #acacaggc   1200 tttaaaaagt atgaattcag ggttgcctac ctgcattgac tgcgcttgaa tc #atgatggc   1260 cttctggtcg gtggcaggag gtgacagtcc aaatcatgca gtagcaaacc ag #atacttaa   1320 attatcatct gagatacttc agaagtacag ccgtagccat accttcagaa ga #gataaaga   1380 aatgttctcc tggccaggcg cggtggctca cgcctgtcat tccagcactt tg #ggaggccg   1440 agggggtgga tcacctgagg tcgggagttc gagaccagcc tgaccaacat gg #ggaaaccc   1500 tgtctctact aaaaatacaa aattagcggg gcgtggtggc acatgcccat aa #tcccagct   1560 actcgggagg ctaaggcagg ataatcgctt gaacctgaga ggcagaggtt gc #ggtgaact   1620 gagatcatgc catagtactc cagcctgggc aacaagagtg aaactccatc tc #aaaaaaaa   1680 aaaaaagaaa aaaagataaa gaaatgttct cctttcttgc catttctagg gg #tttgggga   1740 tggcgtacat tgctgcaggg cgtgctcact ctaccatctt gctccaatct tt #atttttca   1800 aaatacagtg cttatgcttg gttacttcag ttaagattat ttttaaaaat ca #taattaag   1860 caaaaatata tggccatgct taaacatatt taagataaat taagtgattt gg #cctgtttc   1920 agtatcccaa ctcacatgct aacaggggct tgacctgtag ctacggtacc ct #ggaggaaa   1980 tgatcgcatt tatttggtta tttcggtcta agtagtaata gttctgtcct gg #gaaaaaga   2040 ctagcctcaa ggcatttctg attgaatgtt tttcaattac agtctttaaa cc #agtatgcc   2100 acagaactgg ctctttccac atgacggcct ttgtggtggg tggcagattg cc #ctgaggcc   2160 tcgcaaaatg ctaggctttc acaatgtcac tgactgacag ccaggcccag ca #cagtcttg   2220 gtgtgattgt ggggctaaag ttattccacc ttgtgcaata gctacagcct tc #tctaacca   2280 gctgcattct tataaagtta gaagaaaata cttttttttt tttgagatgg at #tctcgctc   2340 tgttgcccag gctggagtgc aatggtgcga tctcggctcg ctgcaacctc cg #cctcctgg   2400 gttcaaacga ttctcctccc tcagaccccc gagtagctgg gattgcaggt gc #ctgccacc   2460 acgcccggct aacttttttg tatttttagt ggagacgggg tttcaccatc tt #cgtcaggc   2520 tggtctcaga ctcctgacct caagtgatct gcccgcctca gcctcccaaa at #gctgggat   2580 tacaggcatg agctactgtg cccggccaaa gaaaatactt tttatgccag cc #ctgaaact   2640 accctgaagc acatacatca accttgaggc ctcacactcc atcaagaggg gt #gaagggca   2700 tgaggaatta gaaagcatag ggatttttag ttagacagat ctggttcaaa tc #ctagactt   2760 gtgccttgaa caaattattt accctcattg aactctagat tcattatttg ta #aaatgaaa   2820 gacaataata gttatctcca aaggaaagtt gaatatgatc attcatttat tc #attaattc   2880 aacatttatt attgcctact ttgtgccagg ttctattcta ggaactaagg ga #tacaactt   2940 tgaataggca aaatctctgc tctcctgaag tttacttttt tttttttttt tt #gagacaga   3000 gtttcactct tgtcacccag gctggagcgc aatggtgctc ttggctcact gc #aacctcca   3060 cctcctgggt tcaagtgatt ctcttgtctc agcctcccaa gtagctggga ct #acaggtat   3120 gtgccaccac gcccggctat ttctgcattt ttagtagaga tggggtttca cc #atgttggc   3180 cagactggtc tcaaactcct gatctcaggt gatatgcctg tcttggcctt cc #aaagtact   3240 gggattacag gcctgagcca ctgcacctga cctgaagttt atgttctatt aa #atagcaac   3300 agacagtaac ataaaccaaa aataaatagg aaaacaccat aacaaaaatc aa #acagtgat   3360 ataattgaga gttgcttcta tttctttttg ttgtcttctt ggttcaatca gc #ctgctaaa   3420 ctatatggaa cctcattttc atgggccact tatttaagcc gggggacctt gg #aaagtctc   3480 tcatgtctct catctcaacg gcctaatgtg acttctcttg aaatatttgg ac #attagcag   3540 gaagctgagg ctttacatca gatctttact ttaatggtgg acttgacttt ac #tggtagat   3600 ttttaggctc tgtgtggact gtggagatga tatctggggg gcaggcagac ac #ttgccctg   3660 cctctgtctg agaaaattct gttttggatg tcttgttgaa gttggtgctg gc #atcctaag   3720 cccttgctgg ggtcgtaatt taattcatca gaatgtgtgg cttgcaagaa cc #ggctcaga   3780 tcctgcsctt caaaaacaaa acatgagcgt gccaagaaag tccaaggtgt tg #aatgttgc   3840 cacttcaagc ctaaactttc taggaacacc taagtgggtg gcagcttcca gt #tctccagg   3900 ctgcttctag gccagagctg ggttccacaa gagacagaat aggcatatat at #gcttaagg   3960 aactggaaaa acaggctctc tctctctcac aaacacacac acacacatac ca #aggtagct   4020 gtcaaaatgt tatccgaaat tttggaacca aaaaatcttg aaagatggta tt #ccaatatc   4080 acattttatg taagttttct attatattag attcaaatta cgattcgagg cc #acaagctt   4140 taagaattca gggccttttt aacttgccaa gccccacacc actccaggaa ct #tccccaca   4200 ccccagttct cagaattcat gtgcaaggtc tttcctaaat ccagggtcca gg #tcagagag   4260 tggaggatgt gctctatttc ttacctgatt gcagacccct ctgacagtgc tc #ccttctga   4320 agcactcact gtctgaacgt acacagtctc agacttaatc atgcacagtg ag #caagactg   4380 tggtgtgata attggcgtcc ctgacttatt agggcaaatc tatgggaggg gg #agacctcc   4440 tggaccactg agcaattaat tcatttacat taggaagttt ctccgtcaga tg #caggaaaa   4500 aaatcttgtt ttcctgctgt ggttttgact tttgccccat cttctgttgc tg #ttgtagga   4560 ggcaaaataa gggtcaaggc ctggaaacac aagtgctttg actgaagctc ca #cttggctt   4620 ccgaagccca agctgggttg taccaggttc cctagggtgc aggctgtggg ca #actgccag   4680 ggacatgtgc ctgcccaccg gcctctggcc ctcactgagt tggccaatgg ga #aatgacaa   4740 ttgtgaggtg gggactgcct gcccccgtga gtaccaggct gttgaggctg gg #ccatctcc   4800 tcctcacttc cattctgact gcagtctgtg gttctgattc cataccagag gg #taagagca   4860 attctgtgaa gttccaggct gggtggggga tgcatgcata gcctctggct gg #gatcaccc   4920 aggctctccc gtccgtagta gtgtgggagt ggatacaggt ggatactctg gt #cagagcag   4980 cactggtgga ggcagatatg cactgggctt cttcctccgt tctcccacag cc #ccaagaga   5040 gaaagggtta tttcagacat tccttctaag atgcatggaa ccattctgaa tt #ttacccag   5100 ttcgctctgt agcaggatac ctattgagaa aaagttaggg tcagtaaggt gg #aagggtct   5160 gtccacagat gaagtccaat tcgattaagg gggataaggg aatacattgt ct #cttagctt   5220 gaccaggtag ggcaaaggaa gaagcatata tgaaggcagc ttcagaaaag tc #aagctgag   5280 cactgacttc agactggaat taggaatcca gctctgccac tttattctac tc #agcaaata   5340 tttactgagc aaattctatg ggctagacag tggattgggt tcacaagata ca #atgagtgt   5400 gacatggttg ttgtctatgg atttggggat atatgtaggt atagggatat ct #tacaaggt   5460 aatcaagagg ttctaatgag gccagccatg gtggctcaca cctgtaatcc ca #gcaatttg   5520 ggagaccgag gcgggtggat cacctgaggt caggagttcc agactagcct ga #ccaacatg   5580 gtgaaacccc gcctctacca aaaatacaaa aattagttgg gcgtgatggc ag #gtgcctgt   5640 aatcccagct tctcgggagg ctgaggcagg agaattgtct gaacctggga gg #cagaggtt   5700 gcagtgagcc gagattgttg ccactgcatt ccagcctggg tgacagagcg ag #actttgtg   5760 tcaaaaaaaa aaaaaaaaag aaagaaaaga aaaagaggct ctaatgagat aa #aatgagaa   5820 aagcctggca tgtagtggca acttatgaaa aattgtaatt aaaaaaaaac at #tttctgac   5880 agaagaaact ggatctacct ggtttttctg aagcctaatc ctgctcgccc ca #gtgagtgc   5940 tgtttctgag gcatcctggt tgttttgagc tgtggatgct gaaggttaga gt #gggaggga   6000 ttttagaggt taggtctgcc cctcttgtgt tagaggacat ggatccctgg tc #tggagagg   6060 ttctggtttt tggatcaagc ctcacaaggg gtggcaccaa ctcactccta gg #aactccgc   6120 tagaaggaag gccagctctg cctaattcgg ttggggagat gggggtccct tt #atgctagc   6180 agaatatgtc cgaaggagca tgatggtgtc agctttgttc atgaaggcca gt #ggtacaca   6240 gggagcccgg cagcttcctc agcagtccct gctgccactc ttccttaagt ct #tgaggagt   6300 ctttttttgg cacaatctca gctcactgca acctccgcct cccaggttca ag #cgattctc   6360 ctgcctcagt ctcccaagta gctgagacta caggcatgcg ccaccacgcc ca #gctaattt   6420 ttatattttt agtagagatg gggttcacca tattggccag gatggtctcg at #ctcttgac   6480 ctcatattcc acctgcctcg gcctcccaaa gtgctggtat tacaggtgtg ag #ccactgcg   6540 cctggccgag gagtcttaag ctgagatcac agcattgcac tccagcctgg gc #aaaaagag   6600 caaaactcca tctcaaaaaa aaaaaaaaaa tagacacaag actggctcct tg #tctttttt   6660 ggggacaggg tctcactcta tcacccaggc tggagtgcag tggtgcaatc ac #agctcact   6720 gcagcctcga tttcccaggc tcaagtgacc ctcccatctt agcctcctga gt #agctggga   6780 ctacaggtgt gtgcaaccat gcctggctaa tttttaaaaa ttttttgtag ag #atgaggtc   6840 tcactatatt ggctgggggg cctcaaactc ctgggctcag cagtcctccc ac #ctcagcct   6900 cccaaaaggc tgggattata tgcttgctct ttttaaggtg gctgtaggga ca #aactttcc   6960 acctactcct tgtcaagcca gtggaccggt ggtcccagac atacggctaa ag #tcaagagg   7020 tgatgtcttt tggagagata ctttcaatca ggaatttcaa tcagaaattc aa #tcatgtgg   7080 agagagactt atcctaaaaa tgtggtggtg cgtgggatgc tctgttttat ta #gttccttg   7140 acagtatgta tgtgtgtgag tgtgtgtgtg tgcgcgcgca cactcatttg ga #tgggtgtg   7200 tatgtgtgtg ggggggtggt gcgtacgtat gtggatgtgt ggatgtggtg tg #tgggtgtg   7260 cgcgtgcata ggtggaggtg tgtgtatggg tgcgggtatg tgtgtgtgtt gg #gcatggag   7320 atattgacag ctctcccagg gctgagtgaa ggctttcggg caaagctcct gg #gagctagg   7380 caaagctgag ttgattcctg gttatgccat ttattattgg gttgcaccgt gt #gaaactgc   7440 caatattcta cactttgact tttatttatt tttattttta ttttttttga ga #cagagttt   7500 cacacttgtc acctaggctg gagtgcagtg gcgcgatctc agctcactgc aa #cctctgcc   7560 tcatggattc aagtgattct cctgcctcag cctcccaagt agctggaatt ac #aggtgccc   7620 gctaccacgc ctgactaatt tttgtatttt tggtagagac gggatttcac ca #tgttgtcc   7680 aggctggtct gaaactcctg acatcaggta atccacccac ctcagcctcc ca #aagtgctg   7740 ggattacagg catgagccac tgcgcccggc ccattttgac ttttaaaaat gg #gagtttga   7800 tataattcaa tccagtggtt gaattagcta gcatcgttcc ctctccaagt ct #caggttct   7860 cctacacgtt agagtcaaaa gcagggctat gggaagatta agtaaaataa at #tttgaaaa   7920 tgccttatga aaattacact ccaaagaact cgcgccagtg tcagtgttct ca #tgttcctc   7980 atctcacatg atcacatttc gcggattagg aagctgagtc tgagaagctc cg #tgtagtgc   8040 tttttcggag gcaccgtgat gtgatggaag gctcactcgt taggaagtca ga #acagagtc   8100 tctgagggat catttcctta atctgtcagt ttcctcatct ctgaagttgg gc #tcatttcc   8160 ttccttcatg gagttattgt aaagatgaag ataaataacg tgtaaaatct ag #catgggaa   8220 ctggcttcta taaggttcta ataagtgcat tcctactcct tcccctcagc ct #tcccattt   8280 gtaaaagcaa ggcaggggtg aggtgatttc tggggctcct tttggctctg ac #atttgagg   8340 attttgtatc cttttttttt tcagagtctt gctctgtcac ccaggttgga gt #gcagctca   8400 atgcaaattc cgcctcccag gctcaagcaa ttcttatgtc tcagcctcct ga #gtacctgg   8460 gattacaggc aggcaccacc acccccagct aattttttgt attttcagta ga #gacggggt   8520 tttgccatat tggccaggct ggtcttgaac tcctgacttc atgtgaccca cc #catctcag   8580 cctcccaaag tgctgagatg acaggtgtga gctaccgtgc ctggccaatt tt #gtgtgctt   8640 taatgccctt ttctgctgga agagttggca ccaggtttgg tgatctcttt cc #cccacacg   8700 gctctgcctc ctgccagtcc cagaggggac cctgtccttg catttcacag ga #ttctgctg   8760 ttgcaactga aattccagta ggtcaaagtg aaatttctca tacactttaa ca #tgaagata   8820 aatgatcaca gtatggccct ttaggatcct gagaacatca cggtcatccc ct #ggtataat   8880 tttaaaagca gatgaatcca tgcctgtgcg aggtttgcca ggaaagccag tg #ctgggatt   8940 acagtggaag tctttttatg ctactttttt cttgtatccc tcaccccatg gg #gtggcata   9000 ttgaaaggca ggatgtgtga ccacgatact tttctcctcc tggactatgt ct #aagagtct   9060 gttattgggt tctgaagatc agagtttaat ttccgactcc tctctgtgta gc #tctgggat   9120 cttggaaagc cacttaacct ttctgaagtc ccctttcctc atctctaaaa tg #catacact   9180 catcactaac atttactgag cactgacatg tgccagacac cattctaagc at #tttacaca   9240 gactacacca tttgatcttc caacaaacag aacactgaaa cgcattacag gt #cagaacaa   9300 atgatttgtg cctaagcacc aagaccgtag agcccgtgct ccctattcta cc #ctatcctg   9360 tctctcaaaa tgattgtgag aatcgaatga gacactaggt gagaaaaggg tt #ttataaat   9420 agcattttaa aaatttttta aagtccacaa aatttttaat tttaatacag at #aaaataga   9480 tccctttgtt ttataaaaag taacaaaatt tgttatacaa caactatgtt at #ttattaat   9540 tttgcctttt tgtatgctgc caggaaagaa acattaagaa atcttaaatt ga #ttatggtg   9600 aatcagaagg tctgcctgga ctttttattg ctctaactgt acagctgatc at #actacctc   9660 attttttttt atgacacttc aagggtgcgc ttagcttcat cactccttcg tt #gccaaaag   9720 ctttgtgacc aaaaacaatt aagcagattc ctgagtcact aaatgacaca ta #accagagt   9780 tgagacttag gaacttttag tgccatgcta agcccacagg gacacaacaa at #agcatttt   9840 acaaaggcaa agaattgtga cacttgagat ttagcttgtt gatccttgta aa #agttttct   9900 ttttaggcat aattgagttt tagatcatag tactcactat tacttagtaa ta #attttttt   9960 ctgatagaaa tacagtgtaa caggccgggc gcagtggctc atgcctgtaa tc #ccagcact  10020 ttgggaggcc gaggcgggcg gatcacttga ggtcaggagt ttgagaccag cc #cggccaac  10080 atggtgaaat cccatctcta ctaaaaatac aaaaaattag ccaggtgtgg tc #gtggattc  10140 ctgtgatccc agctacttgg gagggtgagg caggagcatc agttgaaccc ag #gaggcgga  10200 ggttgcagtg agccaagatg gtgccattgc actccagcct gggccacaaa gc #gagactcc  10260 acttcagaaa caaaaaaaaa aagagagaga gagaaaagaa ggaaggaagg aa #ggaaggaa  10320 ggaaagaagg aaggaaggaa ggaaagaagg aaggaaggaa ggaaggaaag aa #ggaaggaa  10380 ggaaagaagg aaggaaggaa agaaggaagg aaggaaggaa ggaagggtaa ca #agcaaagt  10440 gtaacaatgg caatatctaa aaaaataggt atttttatat gtttgtcgtt tt #atatatat  10500 gacccccact ttagagatga ggaaactgag agattaagga aacgatccct ga #gagactct  10560 gttctgactt ccaaatcggt gagctttcca tcgcatcacg gtgcctccga aa #gcatgaca  10620 cggagcttct cagacttagc ttcctaatcc gctaaacggg attatgtgag at #gaggaaca  10680 tgagaacgct gacatgggtg agggttcctt ggagtatcat tttcatgtgg ca #ttttcaaa  10740 acttatttta cctaatcttc ccaaagccct gcttttgact ctaatgtgtc tc #ctgagact  10800 tggagagcgc aagatgctag cgacagagca agactccatc tccagataaa ta #aataagta  10860 aaataaaaaa gaacacaaat aattttgaaa atttttttga aaattaggca cg #tttgcact  10920 gaccttcaat tgttattaat tgctggtttc ccacccagaa ttaagttgga at #gcaacttt  10980 cttttacaat cagagtccgt tcttggtctt ggaaacttct gaggctcctg tg #ctaatccc  11040 actcttgtat ttttggcacc tctaccccgt gccactgtca tggaacccag gc #tgatcgca  11100 cctattagtg gagaaatmtg tccataatac tgaagtttgg ggacaaacag tg #ttccctta  11160 gggtaggaga aagagatctt tatttttaac aaagggggag gagccagaaa ac #tccagaga  11220 cccctgagtt tgccctctct ccaaggtttg gggtaagccc cccgtcaccc tt #tatctctg  11280 gggctttcac atattctgga ttctctcctc ctgtttccca gcagaaaagg at #ggagcctc  11340 acagattctt cccatttctg gagaaaaaca tgcatggagc tcaaagttct tc #tcaggagt  11400 tttattgcca aagccataat aagaaagggt ggaggtgaca agcagtgagg aa #gtttaaag  11460 atgcatgaaa tctgtaaagt ctcagaacaa gaattctcct aaaatgcaaa ag #gggctttg  11520 ctggtctccc cttggcttct catgtagctc acctcttttt tcttatcttg ag #actagtca  11580 aacctaagct gtttctcatt ttatttccag aagctattga gaacactctc ct #gaattctt  11640 caaattcagt agagggcgac aaatgtacat ataaatgatg gtagtgggtc tt #aaataaag  11700 actcatgaca cctaaagggg cagcacctga gtctgattgc acctgtttct gt #tgctgttt  11760 ctgtctctct tctctctgtc tgccatttca ttatcaatgg ttactttact ta #taagatca  11820 tattagaacc tgatatttga taaatgatgc atcagatcta tagtgagaga aa #aaattaat  11880 gcaattaaag gtgttgtaac agctagtctt caagtgggga gaaatcattt ga #gtacctta  11940 ggtcacagct tacatcaaaa caaaaaatca gagctacatt aaaaagtgaa at #tttaacta  12000 tatcaaacaa tagaaaaaaa cagaagaaaa ttgaatactt actaaatctt ag #catgaata  12060 agaactgttt aacacttaga ggcaaggact gggcgtggtg gctcatgctt tt #aatcccag  12120 gactttggga gcccaaggcg ggcggatcac ctgaggtcag gagtttgaga ct #agcctggc  12180 caacatggtg aaaccccgtc tctactaaaa aatgcaaaaa ttagctgcgt gt #ggtggtgc  12240 atgcctgtaa tctcagctac ttgggaggct aaggcatgag aatcgcttga ac #ctgggagg  12300 tggaggctgt agtgagccga gattgtgcca ctgcactaca gcctgggtga ca #gtgtgaaa  12360 tcctctctct caaaaaaaaa aaaaaaaaaa agcaaactag agcagtgagg ta #ccattatt  12420 tcctttgctc actaaactga caacacacaa atgtttttta taatacccaa ag #ctgatgag  12480 ggtagttaag gtatgccctt ttatacacac actaatgatg tactactggt tg #gcagtata  12540 acatatgctg ccatgtgggg atatgtatca ggagacttaa aaatgtgcat ac #cttttggt  12600 ccagtaattt acttctggga atctgtcata acagaataat aatcttgggg aa #agctacat  12660 gcctaaggat atttaaaata ttatttaaaa atcaaagtat aatttcttac ag #aatataaa  12720 ataatatttt aaaatgaaaa tatgctaaaa gtttgatgaa atataaatgg tc #aaatatat  12780 attgattata tccacttact agactagcac tcactctgag acgttaaaaa ta #gtcattat  12840 aaaaactaga aaatgccaaa gacaaaataa aggaataaag ttttacataa ag #tatgattc  12900 cactatgttt aaaaataaac agagacattc ttggagttga gtattgtttt ct #tttctgtc  12960 atgtccaaag aactatataa ctattatttt taatgaacta tatatgtaat at #atacatat  13020 agtttatatg tatatacaaa atttatctca tatatatgat aaagatgaaa ga #tgagttgg  13080 atgtgccacg tgaagtgggt agtatagaaa cccaggtaat ggggcatagg ag #tgggattc  13140 cagataccag gcccatgttt ttggggtgag attgccaatc acggtctttc tt #ccatccct  13200 cacagaggag taggtttgtc ttcaacaaac cttcagttgt cctgaagaca aa #cctaattc  13260 tggagacttc atataatcta gaagagacaa gcaaactgat gaaaaatagt ga #atttttaa  13320 ggtaaaataa agtacatgga ctacactttg tttagaatca gattcttggg at #taaccaca  13380 ttaacccaca gagggtctta gtgatgcctc taatccagga tcctaggacc ta #tttctctc  13440 tgtgagatgc tttctcccaa ctccttggtg agagtgggaa gactaagacc tc #agcaatct  13500 gaggtggagg cctaagatcc ccctaagatc ggaggcagaa tctgagaggg ga #taaaagtc  13560 cctatacctg tattgggccc ttttctggga gggggatatc aaagaatgat tt #tgagacag  13620 ggaggctttt gactacctgt gccacttgag ctctttgcta gggctccaga at #acatattt  13680 caaatacatt ccccctccct ccttccttcc ctcttccact cttccttttt at #cttccttt  13740 cttcttttcc ttcctccttc ccttcctttc tctggctctc tcatgatttc tt #ttcctcat  13800 tataaaagtg cttatttagt ccctactctg ctattagtgt gttagtcttt gt #cccctggt  13860 acttgctgtt taatggagaa atgggtgagc aaaacagaaa ttacagcaga gt #gcaataat  13920 agagctaagc caggtgtata aatccattct cacactgctg taaaaaacta ct #gggtaatt  13980 tataaagaaa agaggtttaa ttgactcaca gttccacagg ctgtacagga ag #catggctg  14040 gggaggcctc agaaaactta caatcatggt ggaagaaaga gcgaagggga ag #caagcaca  14100 tcacacagca gcaggagaga gagagagaaa gagagagaga gagaatatag gg #gaagtgct  14160 acacactttc aaccagatct tgtgagaatt cacctactat catgagaaca gc #aagggata  14220 agtctgcctc catgattcag tcacctccta ccaggcccct tctccaacac at #gtcgacgt  14280 gctatttggg tggggacaca gacccaaacc atattaccag ggcactggag aa #acacagag  14340 gggaaagaac cagccaagga gtgagatgga gaacaaggag gacttcttga aa #cagatgac  14400 atccaaactg ggtcctgaaa gctgaataga gattagacag gggaggaggg gc #agctaaag  14460 atggctcagg caaacaaagg gccaggggat atgttcatgg gatgatgtgt ct #ctcgttgt  14520 ctgcttaaca caaggtgagt ctctccctcc ctctctctct ctttttctct gt #gtgtgttt  14580 gtgtgtgtgc atgtgtgcaa atgtaatata cccaatagtc aaacatgtgc cc #caggagag  14640 gggtagagga agaaagagaa tgagagagta agaaggagga atagacacag aa #aatgagag  14700 agaagggggg aaagaaaaag aagaaaggag ccagaggaga gaagctggtt ag #cattgaat  14760 ggagcaatct gtgtcatcgt acttgggaaa cccaaggatg gattcttggc aa #gtcgactc  14820 ttggagcttt ccctgtgctt ggtcctgtgc tcagacatgg gaaaattaga gg #agtgtcat  14880 ctgtgcaatc actgaattca taatcttggt gaggaaagga gactacacac ag #ggaataat  14940 gctaagtatt acagatttca gggcagaaag agatcaaggt gggctgcaat at #tcagaaaa  15000 gtcttcctgg aaaagttgaa tacttagaaa gcagctccta gaagtagact ct #gctgagat  15060 ggacggagtc ctttgtaggt cccaactggg tgtgtgtgtg gggtctgtct ct #ccatggcy  15120 gacagtgcac atgtggattc cagggctcag gatgctgttg ctgggagctg tt #ctactgct  15180 attagctctg cccggkcatg accaggaaac cacgactcaa gggcccggag tc #ctgcttcc  15240 cctgcccaag ggggcctgca caggttggat ggcgggcatc ccagggcatc cg #ggccataa  15300 tggggcccca ggccgtgatg gcagagatgg cacccctggt gagaagggtg ag #aaaggaga  15360 tccaggtaag aatgtttctg gcctctttca tcacagacct cctacactga ta #taaactat  15420 atgaagkcat tcattattaa ctaaggccta gacacaggga gaaagcaaag ct #tttttatg  15480 ttaaccataa gcaacctgar gtgatttggg gttggtcttc caaggatgag tg #tagatggt  15540 gcctctataa ccaagacttt ggctttgctg catctgcagc tccttttcca tc #ccctttcc  15600 catcttcacc ctcatcccta ttcccagtac attcatattc tgattcctct tt #ctgtctgc  15660 ttaacttcca tttcacccag tggcattcaa ccacatttac tgcacacccc ct #gaaaggct  15720 cagtcctgcc tttggggaac tcttgatcta ggtaagatgt ctaatgtgca ag #gctctgtt  15780 ggtggttacc acaagaaagt ctactctaaa aatgtcaaac tgaatgtgaa ca #agtattca  15840 aagtatggag catagagaaa atrtactcac cgtggacctg atgaagaatg aa #ggcttcaa  15900 ggaggaggca gagcttcagc taggccttga atgatgggta ggcagaatag ag #gaggagag  15960 acatcctaga tggagggggt agaattgcaa aaccagggtt gatggtgcca gc #acataaag  16020 ggctggcagg gtggagggtc tatgatagag acctatagga gataaagata ga #gttgaaat  16080 tatgggagcc tccatgtctg tgggagatat agaaggagga ggtaacacct ct #ctcctttt  16140 gggagctctt attggtttct tgatctataa gtcaagaagg ttgtgagtgg ga #gccacagg  16200 gatggtaatt taggctgtaa ccaacctagg caggagttct gttctttgta gt #cactgagg  16260 tcttctcatt ccttaggtct tattggtcct aagggagaca tcggtgaaac cg #gagtaccc  16320 ggggctgaag gtccccgagg ctttccggga atccaaggca ggaaaggaga ac #ctggagaa  16380 ggtgcctatg tataccgctc agcattcagt gtgggattgg agacttacgt ta #ctatcccc  16440 aacatgccca ttcgctttac caagatcttc tacaatcagc aaaaccacta tg #atggctcc  16500 actggtaaat tccactgcaa cattcctggg ctgtactact ttgcctacca ca #tcacagtc  16560 tatatgaagg atgtgaaggt cagcctcttc aagaaggaca aggctatgct ct #tcacctat  16620 gatcagtacc aggaaaataa tgtggaccag gcctccggct ctgtgctcct gc #atctggag  16680 gtgggcgacc aagtctggct ccaggtgtat ggggaaggag agcgtaatgg ac #tctatgct  16740 gataatgaca atgactccac cttcacaggc tttcttctct accatgacac ca #actgatca  16800 ccactaactc agagcctcct ccaggccaaa cagccccaaa gtcaattaaa gg #ctttcagt  16860 acggttagga agttgattat tatttagttg gaggccttta gatattattc at #tcatttac  16920 tcattcattt attcattcat tcatcaagta actttaaaaa aatcatatgc ta #tgttccca  16980 gtcctgggga gcttcacaaa catgaccaga taactgacta gaaagaagta gt #tgacagtg  17040 ctattttgtg cccactgtct ctcctgatgc tcatatcaat cctataaggc ac #agggaaca  17100 agcattctcc tgtttttaca gattgtatcc tgaggctgag agagttaagt ga #atgtctaa  17160 ggtcacacaa gtattaagtg acagtgctag aaatcaaacc cagagctgtg ga #ctttgttc  17220 actagactgt gcccttttat agaggtacat gttctctttg gagtgttggt ag #gtgtctgt  17280 ttcccacctc acctgagagc cattgaattt gccttcctca tgaattaaaa cc #tcccccaa  17340 gcagagcttc ctcagagaaa gtggttctat gatgaagtcc tgtcttggaa gg #actactac  17400 tcaatggccc ctgcactact ctacttcctc ttacctatgt cccttctcat gc #ctttccct  17460 ccaacgggga aagccaactc catctctaag tgctgaactc atccctgttc ct #caaggcca  17520 cctggccagg agcttctctg atgtgatatc cacttttttt tttttttgag at #ggagtctc  17580 actctgtcac ccaggctgga gtacagtgac acgacctcgg ctcactgcag cc #tccttctc  17640 ctgggtccaa gcaattattg tgcctcagcc tcccgagtag ctgagacttc ag #gtgcattc  17700 caccacacat ggctaatttt tgtattttta gtagaaatgg ggtttcgtca tg #ttggccag  17760 gctggtctcg aactcctggc ctaggtgatc cacccgcctc gacctcccaa ag #tgctggga  17820 ttacaggcat gagccaccat gcccagtcga tatctcactt tttattttgc ca #tggatgag  17880 agtcctgggt gtgaggaaca cctcccacca ggctagaggc aactgcccag ga #aggactgt  17940 gcttccgtca cctctaaatc ccttgcagat ccttgataaa tgcctcatga ag #accaatct  18000 cttgaatccc gtatctaccc agaattaact ccattccagt ctctgcatgt aa #tcagtttt  18060 atccacagaa acattttcat tttaggaaat ccctggtttt aagtatcaat cc #ttgttcag  18120 ctggacaata tgaatctttt ccactgaagt tagggatgac tgtgattttc ag #aacacgtc  18180 cagaattttt catcaagaag gtagcttgag cctgaaatgc aaaacccatg ga #ggaattct  18240 gaagccattg tctccttgag taccaacagg gtcagggaag actgggcctc ct #gaatttat  18300 tattgttctt taagaattac aggttgaggt agttgatggt ggtaaacatt ct #ctcaggag  18360 acaataactc cagtgatgtt cttcaaagat tttagcaaaa acagagtaaa ta #gcattctc  18420 tatcaatata taaatttaaa aaactatctt tttgcttaca gttttaaatc ct #gaacaatt  18480 ctctcttaca tgtgtattgc taatcattaa ggtattattt tttccacata ta #aagctttg  18540 tctttttgtt gttgttgttg tttttaagat ggagtttccc tctgttgcca gg #ctagagtg  18600 cagtggcatg atctcggctt actgcaacct ttgcctccca ggttcaagcg at #tcttctgc  18660 ctcagcctcc cgagtagctg ggaccacagg tgcctaccac catgccaggc ta #atttttgt  18720 atttttagta aagacagggt ttcaccatat tggccaggct ggtctcgaac tc #ctgacctt  18780 gtgatctgcc cacctccatt tttgttgtta ttttttgaga aagatagata tg #aggtttag  18840 agagggatga agaggtgaga gtaagccttg tgttagtcag aactctgtgt tg #tgaatgtc  18900 attcacaaca gaaaacccaa aatattatgc aaactactgt aagcaagaaa aa #taaaggaa  18960 aaatggaaac atttattcct ttgcataata gaaattacca gagttgttct gt #ctttagat  19020 aaggtttgaa ccaaagctca aaacaatcaa gacccttttc tgtatgtcct tc #tgttctgc  19080 cttccgcagt gtaggcttta ccctcaggtg ctacacagta tagttctagg gt #ttccctcc  19140 cgatatcaaa aagactgtgg cctgcccagc tctcgtatcc ccaagccaca cc #atctggct  19200 aaatggacat catgttttct ggtgatgccc aaagaggaga gaggaagctc tc #tttcccag  19260 atgccccagc aagtgtaacc ttgcatctca ttgctctggc tgagttgtgt gc #ctgtttct  19320 gaccaatcac tgagtcagga ggatgaaata ttcatattga cttaattgca gc #ttaagtta  19380 ggggtatgta gaggtatttt ccctaaagca aaattgggac actgttatca ga #aataggag  19440 agtggatgat agatgcaaaa taatacctgt ccacaacaaa ctcttaatgc tg #tgtttgag  19500 ctttcatgag tttcccagag agacatagct ggaaaattcc tattgatttt ct #ctaaaatt  19560 tcaacaagta gctaaagtct ggctatgctc acagtctcac atctggttgg gg #tgggctcc  19620 ttacagaaca cgctttcaca gttaccctaa actctctggg gcagggttat tc #ctttgtgg  19680 aaccagaggc acagagagag tcaactgagg ccaaaagagg cctgagagaa ac #tgaggtca  19740 agatttcagg attaatggtc ctgtgatgct ttgaagtaca attgtggatt tg #tccaattc  19800 tctttagttc tgtcagcttt tgcttcatat attttagcgc tctattatta ga #tatataca  19860 tgtttagtat tatgtcttat tggtgcattt actctcttat cattatgtaa tg #tccttctt  19920 tatctgtgat aattttctgt gttctgaagt ctactttgtc taaaaataac at #acgcactc  19980 aacttccttt tctttcttcc ttcctttctt tcttccttcc tttctttctc tc #tctctctc  20040 tttccttcct tccttcctcc ttttctttct ctctctctct ctctctcttt tt #ttgacaga  20100 ctctcgttct gtggccctgg ctggagttca gtggtgtgat cttggctcac tg #ctacctct  20160 accatgagca attctcctgc ctcagcctcc caagtagctg gaactacagg ct #catgccac  20220 tgcgcccagc taatttttgt atttttcgta gagacggggt ttcaccacat tc #gtcaggtt  20280 ggtttcaaac tcctgacttt gtgatccacc cgcctcggcc tcccaaagtg ct #gggattac  20340 aggcatgagc catcacacct ggtcaacttt cttttgatta gtgtttttgt gg #tatatctt  20400 tttccatcat gttactttaa atatatctat attattgtat ttaaaatgtg tt #tcttacag  20460 actgcatgta gttgggtata atttttatcc agtctaaaaa tatctgtctt tt #aattggtg  20520 tttagacaat ttatatttaa taaaattgtt gaatttaaga tggatgactg tt #ttatttgt  20580 ttgctgttca ccacttctgt tttattctct ttccagaatt cttttggatt gt #ttaaatat  20640 ttcataatat tttatcttaa tttatttatt gggtatttgc ctatatctct tt #gtggtatt  20700 ttttagtggt tgcttgaggg attacaatgt acttaacttt tcacagtgtg ca #taaagtta  20760 atattttgcc acttgcagta aaccgtagaa ggcttataat catattagta cc #tctatcca  20820 ctttctttta tgttgtagtt gtcatatata ttacatctat atacactgaa ac #attatagg  20880 caatgttatg atttttgcat tcgtcagtca tatatatatt ttaaagaatt ta #agaggaga  20940 aaaatacata ttcagatatt catcat           #                   #           20966 

What is claimed is:
 1. An isolated polypeptide fragment of a full length OBG3 polypeptide, wherein said polypeptide fragment sequence consists of consecutive amino acids 101 to 244 of SEQ ID NO:
 6. 2. A composition comprising a carrier and a polypeptide fragment sequence consisting of consecutive amino acids 101 to 244 of SEQ ID NO:
 6. 